Gene Tery_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0214 
Symbol 
ID4241810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp325107 
End bp326174 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content34% 
IMG OID638105559 
ProductGHMP kinase 
Protein accessionYP_720176 
Protein GI113474115 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0700927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.111663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTT TTGTACCAGG TCGCCTTTGT TTATTTGGAG AACATAGTGA TTGGGCAGGA 
GGGTATCGTT CTATGAACCC CCAAATAGAT AAAGGCTATA CAATTGTAGT AGGAGTTGAT
CAAGGAATTT ATGCAGATGT CAAACCCCAT CCTACTCATT TAATTATCAA GACAACTTTG
AATAATGAAA GTCTGATACA GTCGTTTAAT TTGCATAAGC ATAAAGAAAT ATTATTAGCA
GAAGCTCGTC GGGGAGGAGT TTTTAGTTAT GTAGCTGGGG TTGTATATCA AGTTATTAAT
AATTATTCAG TAGGTGGTTT AGAAATAGAT AATTATTTCA CAGATTTACC AATTAAAAAG
GGTCTATCAT CAAGCGCGGC TATTTGTGTA TTGGTGGCTA GAGCTTTTAA CCTATTGTAT
GATTTAAAAC TAAATATTCG TTCGGAAATG GAGTTAGCTT ATCAGGGGGA AGTTACTACT
CCTTCTCGTT GTGGCAAGAT GGACCAAGCT TGTGCTTATG GTAAGCAAGC AATTATGATG
ATATTTGACG GGGAAAAAAC GGATATTATT GAACTTAATC CACCAAAAAA AGATTTATTC
TTTGTTATTG TTGATCTAGG CGCGAGCAAA GATACTCAAG AGATATTGAC TAGATTAAAT
AAATGTTATA TAGAAGAAGC TAATAAAGTA AATGAGAATG TACAATATTA TCTTGGAGTT
ATTAATGCTG ATATTACTAA ACAAGCAGCA TTAGCCTGGC AAAAAGGGGA TGGAGAAAAA
ATTGGTAGTT TGATGCTCAA AGCACAAATT GAATTTGATA AATATATGAT ACCAGCTTGT
CCTTCACAAT TAACATCTCC AGTACTCCAT TTATTACTAA ATTATTCACG TCTCCAAGAA
TATATTTGGG GTGGTAAAGG AGTTGGTTCT CAAGGTGATG GAACAGCTCA ATTCATTGCT
AAAGATGAGA ATAGTCAACA GAAGTTAATT GAAATAATTA ACCTAGATTT TCCTAAAATG
CAATGTTTTA AATTAGTAGT TAGGGCTGAA TATAGCAATT CTAAATGA
 
Protein sequence
MKLFVPGRLC LFGEHSDWAG GYRSMNPQID KGYTIVVGVD QGIYADVKPH PTHLIIKTTL 
NNESLIQSFN LHKHKEILLA EARRGGVFSY VAGVVYQVIN NYSVGGLEID NYFTDLPIKK
GLSSSAAICV LVARAFNLLY DLKLNIRSEM ELAYQGEVTT PSRCGKMDQA CAYGKQAIMM
IFDGEKTDII ELNPPKKDLF FVIVDLGASK DTQEILTRLN KCYIEEANKV NENVQYYLGV
INADITKQAA LAWQKGDGEK IGSLMLKAQI EFDKYMIPAC PSQLTSPVLH LLLNYSRLQE
YIWGGKGVGS QGDGTAQFIA KDENSQQKLI EIINLDFPKM QCFKLVVRAE YSNSK