Gene Tery_1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1108 
Symbol 
ID4242189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1743770 
End bp1744945 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content39% 
IMG OID638106333 
Producthypothetical protein 
Protein accessionYP_720945 
Protein GI113474884 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000685928 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.228939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAATC CTAATCTGGA ACAAATCCAG TTAAATACAG AAGAATATCA ACGTTATTCG 
AGGCACCTGA TTCTACCGGA AGTAGGATTA GATGGTCAAA AACGTCTCAA GGCAGCTAGT
GTTCTATGTA TAGGCACGGG AGGTCTTGGT TCTCCACTAT TGTTATATCT AGCAGCAGCA
GGAATTGGAA ATATTGGAAT TGTAGATTTT GATATTGTCG ATAGTTCCAA TTTACAACGA
CAGGTTATTC ATGGTACTTC CTGGGTGGGT AAGCCAAAAA TTGAATCTGC TAAAAATCGG
ATTCATGAAA TTAATCCTTA CTGTCAGGTT GACCTTTATG AAACCAGGTT AAGTGCTGAA
AATGCCCTTG ACATTCTCAA GTCTTATGAT GTGATTGTTG ATGGTACTGA TAATTTCCCG
ACTCGTTATT TGGTTAATGA TGCCTGTGTT CTTTTGAATA AACCTAATGT CTACGGCTCA
ATTTTCCGCT TTGAGGGTCA GGCAACTGTG TTTAATTATG AAGGTGGACC GAACTACCGT
GACCTTTACC CTGAACCTCC ACCCCCAGGA ATGGTACCTT CTTGTGCAGA AGGTGGGGTG
TTGGGTATTT TACCAGGAAT AATTGGGGTG ATCCAAGCAA CGGAAACTAT CAAAGTTGTT
TTGGGTAAAG GTAAGACTTT GAGTGGTAGA TTGTTACTTT ATAATTCCCT AGATATGACT
TTCCGAGAAT TGAAATTGCG TCCTAATCCG ATACGACCAA TTATTGAAGA GTTGATTGAT
TATGAGCAGT TTTGTGGTAT TCCTCAAGCT AAAGCACAGG AGGCAGAAAC TAAAATGGCT
ATTCCAGAAA TGACAGTTCA AGATTTGAAG CAATTATTTG ATAGTGGGAA GAAGGATGAT
TTTGTTTTAG TTGATGTACG GAACCCCAAT GAATATGATA TTGCCAAAAT TCCTGGGTCT
GTTTTAGTAC CATTGCCAGA TATTGAGCAG GGCCCTGGTG TGACAAAGGT GAAGGAGTTA
ATGAATAATC GCTCTTTAAT TGCTCATTGT AAGATGGGGG GGAGATCGGC TAAAGCTTTA
GGTATTCTTA AAGAACATGG TATTGAGGGT ACTAATCTCA AGGGTGGAAT TACTGCTTGG
AGTAAGGAAA TAGATTCTTC TGTACCTCAA TATTAA
 
Protein sequence
MLNPNLEQIQ LNTEEYQRYS RHLILPEVGL DGQKRLKAAS VLCIGTGGLG SPLLLYLAAA 
GIGNIGIVDF DIVDSSNLQR QVIHGTSWVG KPKIESAKNR IHEINPYCQV DLYETRLSAE
NALDILKSYD VIVDGTDNFP TRYLVNDACV LLNKPNVYGS IFRFEGQATV FNYEGGPNYR
DLYPEPPPPG MVPSCAEGGV LGILPGIIGV IQATETIKVV LGKGKTLSGR LLLYNSLDMT
FRELKLRPNP IRPIIEELID YEQFCGIPQA KAQEAETKMA IPEMTVQDLK QLFDSGKKDD
FVLVDVRNPN EYDIAKIPGS VLVPLPDIEQ GPGVTKVKEL MNNRSLIAHC KMGGRSAKAL
GILKEHGIEG TNLKGGITAW SKEIDSSVPQ Y