Gene Moth_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2168 
Symbol 
ID3833017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2268244 
End bp2269836 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content64% 
IMG OID637830090 
Producthypothetical protein 
Protein accessionYP_431000 
Protein GI83590991 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00013257 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000197316 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTACCTGG TAACGGCAGC AGAGATGGGA CAGCTGGATC GCCTGGCGTC CAGCGAGTAC 
ATGATACCCA GTATCGTCTT AATGGAAAAC GCCGGCTTGC GGGTGGTAGA ATCCATCGAG
CGCCACTTTC AGGGCCAGGT AGCCAACCGC CGGATTTTAA TCTTCTGTGG TAAAGGCAAC
AATGGCGGCG ACGGCCTGGT GGTCGCCCGC CATCTCCTGA ACCGGGGGGC CGAGGTCAAG
GTCTTTCTTC TGGCCCGGCC GGAGGATATA AGGGGCGACG CCAGGACCAA CCTGGAGATT
TACCAGAAAA TGGGCGGCAA GCTGCTGTTG CTCCTCGGGG AGAGCCACCT GCAGCGGGCC
GACATCGCCC TGCTCTATGC CGACCTGGTG GTGGACGCCA TCTTTGGTAC GGGCTTTAAA
GGGGCGGCCA TGGGGCTGCC GGCCGCCGTC ATTAATATGA TCAATAAAGC CCACCGGGAG
ACGGTGGCCG TGGACCTGCC CTCCGGGCTG GAGGCGGATA CGGGGCGTTG CTTCGGACCC
TGCATCCAGG CCACCTGGAC GGTTACCTTC GCCCTGCCTA AACTCGGCCT GGTCGTCGAG
CCAGGAGCCA GCCTGACCGG CCGCCTGGAG GTAGCCGATA TCGGCATTCC CCAGAAACTC
GTAGCCACCC AGCATTTTAA CCGGCGGCTC CTGACGGCCG CCTGGTGCCG CTCCCAGTTG
CCACGTCGGG AGGCCAGCGG CCACAAGGGT TTATATGGTA GGGTCCTGGC GGTGGGCGGT
TCACCGGGTC TTACCGGCGC TATTACCCTG GCGGCTACGG CCGCTTTAAA GGCCGGGGCC
GGCCTGGTAA CGGCTGCCGT CCCCCGGGGG GTTCAGGGTA TCCTGGCCAT GAAAACTACC
GAGATCATGA CCATGTCCCT GCCGGAGACG CCGGCGGGGG CCTTAAGCCG TGACGCCCTG
GACCCGCTCC TGGAGCGCCT GGCAGAAGTC GACGTCCTGG CCATCGGCCC GGGCCTTTCC
CGGGACCCGG CTACGGTAGA CCTGGTAAAA GAGTTGCTTC CCCGGGTACA GGTGCCGGCG
GTGGTAGACG CCGATGCCCT GAACGCCCTG GCGACAGATA CGAGGGTCCT GACCGGCGAT
CATGGCCCCC TGGTCCTGAC CCCGCACCCC GGAGAAATGG CCCGCCTGCT GGGAACTACC
GCCGCCAAGA TCCAGGAAGA CCGCCTGGAG ATAGCCGCCA AGTACGCCCG GGAATGGCAG
GCGGTCCTGC TGTTGAAGGG TGCCCGGACA GTTATTGCCT GGCCGGACGG GCAGGTATAT
ATCAATCCTA CCGGTAACCC CGGCATGGCT ACCGCCGGCA GCGGCGATGT ATTGACAGGG
ATTATTGCCG GGCTTGCAGG TCAGGGGCTT AAGCCCGGGG TGGCTGCCGC CCTGGGAGCC
TATCTCCACG GGGCGGCCGG GGATGAAGCA GCCAGGCAGC GGGGCCAGCG GGCCATGATG
GCCGGGGATC TGTTGGACTT TTTGCCATAC GTCTTGCGTA ACCTGGAGGA GGAGGTAGAG
ACTATTGTCG CGGCCGGTTT GGGCCGAGAT TGA
 
Protein sequence
MYLVTAAEMG QLDRLASSEY MIPSIVLMEN AGLRVVESIE RHFQGQVANR RILIFCGKGN 
NGGDGLVVAR HLLNRGAEVK VFLLARPEDI RGDARTNLEI YQKMGGKLLL LLGESHLQRA
DIALLYADLV VDAIFGTGFK GAAMGLPAAV INMINKAHRE TVAVDLPSGL EADTGRCFGP
CIQATWTVTF ALPKLGLVVE PGASLTGRLE VADIGIPQKL VATQHFNRRL LTAAWCRSQL
PRREASGHKG LYGRVLAVGG SPGLTGAITL AATAALKAGA GLVTAAVPRG VQGILAMKTT
EIMTMSLPET PAGALSRDAL DPLLERLAEV DVLAIGPGLS RDPATVDLVK ELLPRVQVPA
VVDADALNAL ATDTRVLTGD HGPLVLTPHP GEMARLLGTT AAKIQEDRLE IAAKYAREWQ
AVLLLKGART VIAWPDGQVY INPTGNPGMA TAGSGDVLTG IIAGLAGQGL KPGVAAALGA
YLHGAAGDEA ARQRGQRAMM AGDLLDFLPY VLRNLEEEVE TIVAAGLGRD