Gene Tery_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4031 
Symbol 
ID4242059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6232970 
End bp6233995 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content40% 
IMG OID638108941 
Productthiamine monophosphate kinase 
Protein accessionYP_723522 
Protein GI113477461 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0483109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.163245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAG TTAAAGATAT TAGCGAACAA GAGCTTTTAA AAAAATTGCG CCATTTTTGT 
CCAGAAAATA TTATTGGGGA TGATGCCGCC ATTTTGCCAA CTCAACCCAA CAAGTCATTA
GTGGTGACAA CTGATACACT TGTTGATGGA GTACATTTTA GTGATCGCAC AACCTCTCCT
GAAGATGTAG GTTGGCGTAG TGCTGCTGCT AACTTATCCG ATATCGCAGC AATGGGAGCG
TTTCCCATTG GAATTACGAT CGCTCTTGGT TTGACTGTGG ATACACCCAT TGCTTGGTGC
GAACAACTCT ATCAAGGTAT TTCTGAATGC TTGCAGCCAT ACAATACTCC TATAGTTGGT
GGAGATATTG TGCGATCCCC AGTTTCTACG ATCTCAATTA CTGCTTTTGG ACAAATAGAT
GGCGGATATA CCATTCTTCG CTCAAATTGT CAACCAAAAA ATGTAATTGT TGCCACTGGA
GTTCATGGAG CATCAAGGGC TGGCTTAGAA TTATTACTTT ATCCAGAACT AGGTGAATCT
CTCACAGAAC TCGAACGTTC TGACCTAATT TTGGCTCACC AAAGACCTAA ACCAAGGCTT
GACATTTTAG ATACTTTATG GGATTGTCTT TCCTTAAATA GTTTCAGTGC TACAAAAGGT
AGCAAAAACT CTATTTTAGT AGGAGGAATG GATAGTAGTG ATGGCCTGGC AGATGCAATT
ATTCAGATTT GTCAAGCATC CCAAATAGGG GCCAGGATAG AACGTAATCA AATTCCTATT
CCTCACTCCT TGACTAAATT CGTCTCAGAT GAACAAGCAT TAAACTGGGC ACTATATGGA
GGAGAAGATT TCGAGTTAGT TTTGTGCCTT CCACCAAATT GTGGCCAAGA ATTAGTAAAA
AGGATTGGAG TCGGTGCAGC CATAGTGGGT ACAACCACAA ACAACACCGA CATCTTATTA
ATAGATAGCC AAGGAATATA CCCAAATGAG CAACTTACAC TTAGCCAAGG ATTTCAACAT
TTTTGA
 
Protein sequence
MLKVKDISEQ ELLKKLRHFC PENIIGDDAA ILPTQPNKSL VVTTDTLVDG VHFSDRTTSP 
EDVGWRSAAA NLSDIAAMGA FPIGITIALG LTVDTPIAWC EQLYQGISEC LQPYNTPIVG
GDIVRSPVST ISITAFGQID GGYTILRSNC QPKNVIVATG VHGASRAGLE LLLYPELGES
LTELERSDLI LAHQRPKPRL DILDTLWDCL SLNSFSATKG SKNSILVGGM DSSDGLADAI
IQICQASQIG ARIERNQIPI PHSLTKFVSD EQALNWALYG GEDFELVLCL PPNCGQELVK
RIGVGAAIVG TTTNNTDILL IDSQGIYPNE QLTLSQGFQH F