Gene Tery_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3983 
Symbol 
ID4244549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6159015 
End bp6160403 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content40% 
IMG OID638108899 
Productmalate dehydrogenase 
Protein accessionYP_723481 
Protein GI113477420 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAC TAACACCAAA TCCCTCATTT AGTTTAACAA TTAGCCTCGA AACTCCTAAC 
CGTACTGGAA TGTTGGCCAA GGTTACCCAA GCCATCGCAT CAGAAGGAGG CAATATAGGT
AACGTTGACC TAATAAAACA AAGTCGCCAA ATCATAATTA GAGAAATTAC AGTCGATGCC
TACAGTACAG AGCATATTGA AAAAATTGTT CAAGCTGTCA AAACCCTGCC AGAAATAAAA
CTCGTCAATG TGTATGACCG AACCTTCGAT ATACATAAAG GTGGTAAAAT CACAGTTCAA
GGCAAAATAC CTCTCAAATC CCAAGCCGAT CTATCAATGG CTTATACTCC AGGAGTAGGA
AGAATTTCTA AAGCAATTGC AGAAGACCCT CAACAAATCT ATAATTTCAC CATCAAAAAA
AATACAGTTG CTATTGTGAC AGATGGCAGT GCAGTTTTAG GACTAGGAAA TCTTGGCCCC
GGAGGAGCTT TACCAGTAAT GGAAGGTAAG GCTATGTTAT TTAAAGAATT TGCTGGAATC
GACGCTTTCC CTATTTGTCT GGCTACTCAA GATACTGATG CTATTGTCGA GACAGTTAAA
AATATTGCTC CTGTTTTTGG TGGAGTTAAC TTAGAAGATA TTGCTGCTCC TCGTTGTTTT
GAAATAGAAG TCAAGCTAAG AGAGATATTA GATATTCCAG TATTTCACGA TGACCAGCAT
GGCACAGCAA TAGTTAGTTT AGCTGCGTTA ATCAATGCTT TAAAATTGGT CAAAAAATCT
ATAGAAGATA TTCGGATCGT TATTAATGGT GCTGGTGCTG CTGGAATAGC TATCACTCGT
CTATTACAAA AAGCAGGAGC TAATCAAATT TGCTTATGTG ACTCTAAAGG TATAGTCTCC
AAAAGTCGTA CTGGTCTTAA CTCAGAAAAA CAAGCATTTG CCGTTGAGTC TTCAGGAACC
CTCGCAGATG CTTTAATGGG AGCTGATGTA TTTTTAGGTG TTAGTGTCCC GGGAGTTCTT
ACTCCAGAAA TGGTCCGTTC TATGGCCAAA GACCCTATAG TCTTTGCAAT GGCTAATCCT
ATTCCGGAAA TACAGCCAGA ATTAGTTGCA GAAGAAGTAG CAGTGATGGC CACAGGTCGG
AGTGATTATC CGAATCAAAT TAATAATGTT TTGGCATTTC CAGGCATATT TAGGGGCGCT
CTAGATTGTC GGGCTACTAC TCTCACCTCT ACAATGTATT TAGAAGCTGC TAGAGCGATC
GCTTCATTAG TCAAAACTTC AGACCTTGAC CAAGAACATA TTATTCCTTC AGTATTTGAC
ACTAGGGTAG CTACTGCGGT TGCAGGTGCA GTAGCTTCAG CAGCTCGTAC TGAAGGTGTA
ACTGTTTAA
 
Protein sequence
MVKLTPNPSF SLTISLETPN RTGMLAKVTQ AIASEGGNIG NVDLIKQSRQ IIIREITVDA 
YSTEHIEKIV QAVKTLPEIK LVNVYDRTFD IHKGGKITVQ GKIPLKSQAD LSMAYTPGVG
RISKAIAEDP QQIYNFTIKK NTVAIVTDGS AVLGLGNLGP GGALPVMEGK AMLFKEFAGI
DAFPICLATQ DTDAIVETVK NIAPVFGGVN LEDIAAPRCF EIEVKLREIL DIPVFHDDQH
GTAIVSLAAL INALKLVKKS IEDIRIVING AGAAGIAITR LLQKAGANQI CLCDSKGIVS
KSRTGLNSEK QAFAVESSGT LADALMGADV FLGVSVPGVL TPEMVRSMAK DPIVFAMANP
IPEIQPELVA EEVAVMATGR SDYPNQINNV LAFPGIFRGA LDCRATTLTS TMYLEAARAI
ASLVKTSDLD QEHIIPSVFD TRVATAVAGA VASAARTEGV TV