Gene Tery_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1914 
Symbol 
ID4242663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2963591 
End bp2964661 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content39% 
IMG OID638107035 
Productoxidoreductase-like 
Protein accessionYP_721642 
Protein GI113475581 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00346071 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.174579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAC TATATTCAAC TGATTTTACC AAAAATAGTC AAGCAAAACT ACTTCGTGTA 
GGGGTAATTG GGGTCGGTAA TATGGGGCAG CATCATACCC GTGTCCTAAG CTTGCTCAAA
GATACTGAAT TAGTAGGTGT AGCAGATATC AATGTTGAAA GGGGACTAGA TACTGCCAGT
AAGTATGGAG TTCGTTTTTT TGAGGAGTAT CAAGAGCTTT TGCCTCATGT TGATGCTGTC
TGTGTCGCTG TGCCCACTCT TCAACATTAT GCTGTAGGAA TGACCTGCTT AAAGGCGGGA
GTTCATGTAC TAATGGAAAA ACCCATCGCT GCCAGTATTG GTGAGGCGGA GTTTTTGGTT
AATACTGCTG CTGAAACAAA TAGAATTTTA CAGGTGGGTC ATATAGAAAG GTTTAATCCT
GCTTTTCAAG AACTCAGTAA AGTTCTGAAA ACGGAAGAAT TACTGGCCTT GGAAGCTCAT
CGGATGAGTC CTTATTCTCA TAGGGCTAAT GATGTGTCTG TAGTGTTGGA CTTGATGATT
CATGATATTG ATTTATTACT AGAGTTGGTG GCTGCCCCAG TTGTCAAACT GACTGCTAGT
GGTAGTAGTG CTTCTAGTTC GGGAAATTTA GATTACGTTA CGGCTACTCT TGGCTTTGCT
AATGGTATTG TGGCTACTTT AACGGCCAGT AAGGTGACAC ATAAAAAACT TCGTTCCATC
GTTGCCCACT GTAAAAATTC TTTGACAGAG GCAGATTTTC TCAACAATGA AATTCTGATT
CACCGTCAAA CAACAGGAGA TTATGTGACT GACTATGGTC AGGTTCTTTA TCGTCAGGAT
GGTTTAATTG AAAGGGTTTA TACTAGCAAT ATTGAACCTC TTCATGCAGA ATTAGAGCAT
TTTGTTTATT GTGTAGGTGG AGGAAATAAA CCTTCAGTTG GAGGAGAACA AGCTCTTAAG
GCATTAAGAT TAGGTAGTTT GATTGAGAAA ATAGCTATTG ATGATTTGGC TTACCATCCA
GTAGATTCAG AATTTAATTT GTTGAATAAT TCTATGCTGA AAGTGGGGTA A
 
Protein sequence
MSTLYSTDFT KNSQAKLLRV GVIGVGNMGQ HHTRVLSLLK DTELVGVADI NVERGLDTAS 
KYGVRFFEEY QELLPHVDAV CVAVPTLQHY AVGMTCLKAG VHVLMEKPIA ASIGEAEFLV
NTAAETNRIL QVGHIERFNP AFQELSKVLK TEELLALEAH RMSPYSHRAN DVSVVLDLMI
HDIDLLLELV AAPVVKLTAS GSSASSSGNL DYVTATLGFA NGIVATLTAS KVTHKKLRSI
VAHCKNSLTE ADFLNNEILI HRQTTGDYVT DYGQVLYRQD GLIERVYTSN IEPLHAELEH
FVYCVGGGNK PSVGGEQALK ALRLGSLIEK IAIDDLAYHP VDSEFNLLNN SMLKVG