Gene Tery_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2056 
Symbol 
ID4245704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3207590 
End bp3209005 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content34% 
IMG OID638107167 
Productprotein of unknown function DUF224, cysteine-rich region 
Protein accessionYP_721770 
Protein GI113475709 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCA AAGACATAAG TATTCAAAGT AATCCAACAA TTCCAGCGAA AGATAGCAAA 
TTTTTACAGC AAAATCCACA CTTGCAATCT GACTTAGATC ACTCTAGTTT TGACTCTAAA
AATCCACCAT CACCAGAATT AATTGGTGCC TGCGTCCACT GTGGATTTTG TTTATCAACA
TGTCCTAGTT ATCGAGTAAT AGGCAAAGAA ATGGATTCTC CCAGGGGCAG AATTTATGTT
ATGGATGCTA TTAACAATGG AGAAGCATCT TTAAATCAAA CAAGTTCTCA ACATTTTGAT
ACTTGCTTAG GTTGTCTCGC TTGTGTAACT ACTTGCCCTT CAGGAGTTAG ATATGATAAA
TTAATTGCTG CAACTCGCCC ACAAGTAGAA CGAAACATTC CCCGTTCATT ACCTGATAAA
TTAATTCGTA GTCTGATATT TAATTTATTT CCTTATCCTA ATAGATTACG ACCTTTACTT
ATTCCTTTAT TTATTTATCA AAAATTAGGA TTTAACAAAC TAATTCGTAG TAGCAAATTA
CTTGATAAAA TATCTCCTAG ATTAGCAGCA ATGGAATCTA TTCTACCAGA AATTACAGTT
GATTCTTTCT CTAATAATTA TCCAAATATT ATTCCGGCTG AAGGAGAAAA ACGTTATCGA
GTTGGTTTGA TTTTAGGTTG TGTTCAAAGA ATATTTTTCT CCTCTGTTAA TATGGCAACA
ATTCGAGTTT TAACTGCTAA TGGTTGTGAA GTTGTGATTC CTAAAAGTCA AGGTTGTTGT
GCTGCATTGC CAGAACATCA AGGACAAACC GAACAAGCTC ATGCTTTGGC AAAACAAATG
ATCGATAGTT TTGTAAATAC AGGAGTTGAT GCAGTTATTA TTAATGCTGC TGGTTGTGGT
CACACTCTTA AAGAATACGA TAATATTCTA CAGGATGATT CTGAGTATTG TGACAAGGCA
AAAGAATTTT CTAATAAAGT TAAAGATGTG CAAGAATTCT TAGCAAATGT AGGATTGACA
GCTAAACTTT ATCCTTTGGT TGAGGAAGAA GAATTGACTA TAGTTTATCA AGATGCTTGC
CATTTGTTGC ACGGTCAAAA AATTAGTTTA GAACCTAGAA AATTGCTGCT AAAAATTCCT
GGGGTGAAGT TGCGTGAACC TATAGATGCA GCTTTATGTT GTGGAAGTGC GGGAGTCTAT
AATATGCTAC AACCGGAAAC AGCTAATGAA TTAGGAGAAC AAAAAGTAGA AAACTTATTG
AATACAGGTG CAGAATTAAT TGCTTCTCCT AATCCTGGAT GTTCTTTACA AATTAAAAAG
CATTTAGATT TGCAAGGTAA TAATATGAGT TTAATGCACC CAATAGAATT ATTAGATTAT
TCAATTCGGG AGGTAAAGTT AAATCTAAAA AAGTAA
 
Protein sequence
MDLKDISIQS NPTIPAKDSK FLQQNPHLQS DLDHSSFDSK NPPSPELIGA CVHCGFCLST 
CPSYRVIGKE MDSPRGRIYV MDAINNGEAS LNQTSSQHFD TCLGCLACVT TCPSGVRYDK
LIAATRPQVE RNIPRSLPDK LIRSLIFNLF PYPNRLRPLL IPLFIYQKLG FNKLIRSSKL
LDKISPRLAA MESILPEITV DSFSNNYPNI IPAEGEKRYR VGLILGCVQR IFFSSVNMAT
IRVLTANGCE VVIPKSQGCC AALPEHQGQT EQAHALAKQM IDSFVNTGVD AVIINAAGCG
HTLKEYDNIL QDDSEYCDKA KEFSNKVKDV QEFLANVGLT AKLYPLVEEE ELTIVYQDAC
HLLHGQKISL EPRKLLLKIP GVKLREPIDA ALCCGSAGVY NMLQPETANE LGEQKVENLL
NTGAELIASP NPGCSLQIKK HLDLQGNNMS LMHPIELLDY SIREVKLNLK K