Gene Tery_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2199 
Symbol 
ID4243232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3430009 
End bp3431526 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content42% 
IMG OID638107301 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_721901 
Protein GI113475840 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.539772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCAA TTAGACCTGA CGAAATTAGC CAAATTATTC GCGACCAGAT AGAACAGTAT 
GAGCAAGATG TAAAAGTATC TAATGTTGGT ACTGTTCTTC AAGTAGGTGA TGGTATTGCC
CGTGTTTATG GGTTAGACAA AGTAATGGCT GGAGAACTGG TAGAGTTCGC AGATGGTACT
GTAGGTATTG CCCAAAACCT AGAAGAGGAT AATGTAGGTG CAGTATTAAT GGGTGAAGGC
AGAGAAATCC AAGAAGGCTC TGCTGTAACC GCTACAGGTA GAATTGCCCA GGTGCCAGTA
GGAGACGCGT TGGTTGGCCG GGTGGTTGAT GGCCTTGGTC GTCCAATTGA CGGTAAAGGA
GAGATGAAAA CTACTGATAG TCGTCTACTG GAATCACCAG CTCCAGGAAT TATCGATCGC
CGTTCTGTTT ATGAACCTAT GCAAACAGGT ATTACTGCGA TTGATTCTAT GATTCCTATT
GGTAGGGGTC AACGAGAACT GATTATCGGT GATAGACAAA CAGGTAAAAC AGCGATCGCC
ATTGATACTA TTATTAACCA AAAAGGTGAA GATGTTATCT GTGTATATGT AGCTATTGGT
CAAAAAGCTT CTACAGTAGC TCAGGTAGTA GGTACTCTAG AAGAAAAAGG CGCACTAGAT
TATACGGTGA TAGTAGCAGC TAACGCTAGT GATCCAGCAA CTCTACAATA TTTAGCTCCT
TATACTGGTG CTACTATTGC CGAATACTTC ATGTACAAAG GCAAAGCAAC TTTGGTAATC
TACGATGACC TTTCCAAGCA AGCTCAAGCT TATCGTCAGG TATCTTTGCT ATTACGTCGC
CCACCAGGTC GGGAAGCATA TCCAGGAGAT GTGTTTTATC TCCACTCCCG TCTTCTAGAA
AGAGCTGCTA AACTCAATGA CAAACTTGGT GGAGGTAGTA TGACTGCTCT ACCAATTATT
GAAACTCAAG CAGGTGACGT TTCAGCTTAC ATCCCTACTA ACGTAATTTC CATTACTGAT
GGTCAAATAT TCCTGTCTAG TGACTTATTT AATGCAGGTT TCCGGCCGGC GGTGAATGCT
GGTATTTCTG TATCTAGGGT TGGTTCTGCG GCTCAAACTA AAGCTATCAA AAAAGTTGCT
GGTAAAATTA AGTTAGAGCT AGCTCAGTTT GCTGAATTAG AAGCATTCTC CCAGTTTGCT
TCTGACCTGG ATAAGGCTAC TCAAAACCAA CTGGCACGGG GTCAGCGTTT GCGGGAGATT
TTGAAGCAAC CTCAAAATTC ACCTAGGTCA CTTCCTGAGC AGGTAGCAGC AATTTATTCT
GGTATTAATG GTTACTTAGA TGATATTCCT CTGGAAAAAG CGGCTAAGTT TATTGCTGGT
TTACTAGATT ACTTAAACAA TAGTAAGCCT AAGTTTGGTG AAATCATTAA AACAGAAAAG
GTACTTACTG ATGAAGCACA AACTCTCTTA AAAGAGGGGA TCACTGAGTA TAAGCAGACA
TTCTTGGTAT CAGCTTAG
 
Protein sequence
MVSIRPDEIS QIIRDQIEQY EQDVKVSNVG TVLQVGDGIA RVYGLDKVMA GELVEFADGT 
VGIAQNLEED NVGAVLMGEG REIQEGSAVT ATGRIAQVPV GDALVGRVVD GLGRPIDGKG
EMKTTDSRLL ESPAPGIIDR RSVYEPMQTG ITAIDSMIPI GRGQRELIIG DRQTGKTAIA
IDTIINQKGE DVICVYVAIG QKASTVAQVV GTLEEKGALD YTVIVAANAS DPATLQYLAP
YTGATIAEYF MYKGKATLVI YDDLSKQAQA YRQVSLLLRR PPGREAYPGD VFYLHSRLLE
RAAKLNDKLG GGSMTALPII ETQAGDVSAY IPTNVISITD GQIFLSSDLF NAGFRPAVNA
GISVSRVGSA AQTKAIKKVA GKIKLELAQF AELEAFSQFA SDLDKATQNQ LARGQRLREI
LKQPQNSPRS LPEQVAAIYS GINGYLDDIP LEKAAKFIAG LLDYLNNSKP KFGEIIKTEK
VLTDEAQTLL KEGITEYKQT FLVSA