Gene Tery_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2093 
Symbol 
ID4243927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3267818 
End bp3268873 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content38% 
IMG OID638107202 
Productpentapeptide repeat-containing protein 
Protein accessionYP_721805 
Protein GI113475744 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.391868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTA GCCAGCTTTT AAGTGCAAAT GAACTGTTAT TTAGACATTC CCAAGGAGAA 
AGAAATTTTC AGGGTGCAAA CCTAATTGCC GTTAATTTAA GTGCAGTCAA CCTCAATTGT
AGCAATCTAA GTAATGCTAA TTTTAAAGAT TCATACTTAG GTAAAACAAA ACTAATTGGA
TCTAATTTAA ATGGTGCAGA TTTTAGTTAT GCTAATCTGT CCGAAGCCAA ATTTATAGAA
GCTAATTTGA GTGCTGCTAA CTTTACTAAA ACCACACTTA TTGCAACCGA TATAAGTGGA
GGAATTTTGA GTGGGGCAAT TTTCTCAGAA GCTAATTTAA CAAGAGCTAT TCTCATTGGT
ACTAGCATGG TTGGAACTTC TTTACTAAAT TGCTCAATAT TAACCAAAGC TAACCTCACA
AGAGCTACTC TTTCTCGTGC TATTCTTAGT GGTGCTGACT TAACACAGGC TAACTTGAAT
CGGGCAATTA TGACTGAAGT GGACCTAAGT GGCACTTTGC TAAATCAGGC AAGTCTAATT
CGAGCCTATC TACAACGGGG TAATCTCAAT GGTGCAAAAC TAATCAAGGC AGATTTAACA
GAAGCCACTT TAGTACAGGC CAACCTTTGT GCTTCTGATT TAACTGGAGC AGAGTTGCAA
GGTGCAAATC TCAGTTATGC TAATTTAAGT GGGTCAAATT TGATGGGAGC GAATCTACAG
GGAGCAAATC TCAGCAATAC TAATCTTAAT GGTGTTATTC TCCAACAGGC AGACCTGCAA
GCTGCTGACT TGAGCAAAGC TAGCTTACGA GGTGCTAATT TAAAAGCTGT TAATCTCTCA
GGGGCAAATT TATTGAAAGC TGACTTGCGC GATACTAACT TACAAAAGGC TAATCTTTAT
GGCGCTGGTT TATTGTTAGT ATCTCTCAAA GGCGCCAACT TAAAAGAAGC CTGTTTATGT
AATGCTAACT TAATTGGGTC TAGTTTAAAT CTTTCTAGTC TTCAGGATGT TTGCCTAGAA
AAAACAATTA TGCCTAATGG TTCAATTCAT GAATAG
 
Protein sequence
MNASQLLSAN ELLFRHSQGE RNFQGANLIA VNLSAVNLNC SNLSNANFKD SYLGKTKLIG 
SNLNGADFSY ANLSEAKFIE ANLSAANFTK TTLIATDISG GILSGAIFSE ANLTRAILIG
TSMVGTSLLN CSILTKANLT RATLSRAILS GADLTQANLN RAIMTEVDLS GTLLNQASLI
RAYLQRGNLN GAKLIKADLT EATLVQANLC ASDLTGAELQ GANLSYANLS GSNLMGANLQ
GANLSNTNLN GVILQQADLQ AADLSKASLR GANLKAVNLS GANLLKADLR DTNLQKANLY
GAGLLLVSLK GANLKEACLC NANLIGSSLN LSSLQDVCLE KTIMPNGSIH E