Gene Tery_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3369 
Symbol 
ID4243464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5165518 
End bp5166480 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content41% 
IMG OID638108354 
ProductNADH ubiquinone oxidoreductase, 20 kDa subunit 
Protein accessionYP_722944 
Protein GI113476883 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.626473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAATC TACTGTGGCT TCAAGGTGGA GCTTGTTCTG GCAATACCAT ATCATTTCTC 
AATGCTGAAG AACCAAATAT TTGTGATTTA ATTACAGACT TCGGCATTAA TGTACTTTGG
CATCCTTCCC TAGGAATAGA ACTTGGTACA AATGTTCAAC AATTATTAAA AGATTGTATT
TCCGGAAAAA TACAATTAGA TATTTTAGTA TACGAAGGAA CAGTAATTAA CGCACCAAAT
GGTATGGGAG ATTGGAACCG TTTTGCCGAT CGGCCAATGA AAAACTGGGT AAAAGAATTA
GCAGAAATTG CCAGTTTCGT TGTTGCGGTA GGAGACTGCG CTACTTATGG CGGTATTCCG
GCAATGGCAC CAAACCCCAG TGAGTCTGAA GGTTTACAAT TTCTGAGACG GAAAAAAGGT
GGATTTTTAG GAGAAGACTT CAAAAGCCAA GCTGGATATC CTGTGATCAA CATTCCTGGT
TGCCCTGCAC ACCCAGATTG GATTTCGCAA ATATTGGTAG CAGTAGCCAC TGGCAGACTT
AATGACATTA CCTTAGACGA ATTTCACCGC CCTGAAACAT TTTTCAAAAG CTTCACTCAA
ACAGGTTGTA CTCGCAACGT TCACTTTGCT TACAAAGCTT CAACCTCAGA TTTTGGGCAA
CGTCAAGGAT GTCTATTTTA TGATTTAGGT TGTCGCGGAC CGATGACTCG TTCTTCCTGT
AACCGCATTT TGTGGAACCG AGTTTCTTCT AAAACTCGAG CGGGGATGCC ATGTTTAGGT
TGCACGGAAC CAGAATTTCC ATTCCAGGAT CTTATGCCTG GAACAGTATT TACAACTCAA
ACAGTTATGG GAGTGCCTAA AGAATTACCA ACAGGAGTTA ACCGTAAAGA TTATGCTGTA
TTGACTATGG TTGCTAAAAA TTCAACGCCA GCATGGGCAG AGGAAGATTT CTTTACAGTT
TAA
 
Protein sequence
MINLLWLQGG ACSGNTISFL NAEEPNICDL ITDFGINVLW HPSLGIELGT NVQQLLKDCI 
SGKIQLDILV YEGTVINAPN GMGDWNRFAD RPMKNWVKEL AEIASFVVAV GDCATYGGIP
AMAPNPSESE GLQFLRRKKG GFLGEDFKSQ AGYPVINIPG CPAHPDWISQ ILVAVATGRL
NDITLDEFHR PETFFKSFTQ TGCTRNVHFA YKASTSDFGQ RQGCLFYDLG CRGPMTRSSC
NRILWNRVSS KTRAGMPCLG CTEPEFPFQD LMPGTVFTTQ TVMGVPKELP TGVNRKDYAV
LTMVAKNSTP AWAEEDFFTV