Gene Moth_2269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2269 
Symbol 
ID3831380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2376428 
End bp2377768 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content59% 
IMG OID637830189 
Producthypothetical protein 
Protein accessionYP_431099 
Protein GI83591090 
COG category[S] Function unknown 
COG ID[COG5441] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00139648 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACAGCGA AGGACATTGC CATTATCGCC ACAGTAGACA CCAAAGAGGC CGAAGCCCGT 
TTTCTGCAGG AGTTTATCAC CAGTCATGGC TGGCAGGCCC CGGTGCTGGA TGTCAGCACT
CATCGTCCCC ATAATTTTCA GGCGACCTAT TCCAGGGAAG AGATCTGCCG CCGGGCTGGG
GTGGAGTACA AGGATTTGGG CACCTTGCGC CGGGATGCCA TGATGGCAAC CATGGGCCGC
GGAGCGGCCC GGGTATTAAT GGAACTTTAT GACCGGGGAG AGCTGGCGGG CGTCCTGGGC
ATCGGCGGCA ACCAGGGTAC GGCCATAGCA GCTATGGCCA TGCGCTCTTT GCCTGTCGGG
CTGCCCAAGT TAATAGTTTC TACGGTGGCC TCGGGCAATG TCCGGCCCTA TGTAGAGTAC
AAGGACATTA CCATGATGTT CTCAGTAGCC GACCTGCTGG GTGGTCCCAA CACCGTCAGC
CGCACTATCC TCAGCAATGC TGCCGGGGCG GTGATAGGAA TGGCCGCCTG GGGCCAGCCC
CTGAAGGCGG GGGAACGGCC GGTAATTGCC ATCACAGCTC TGGGCAATAC CGACCCGGCA
GTAGCAGCCG CCCGGGGGCG ACTGGTGGAA CTGGGTTACG AAGTGATAGC CTTTCATGCT
TCCGGGACCT GCGGATCAGC CATGGAAGAA CTAATCGAAG CAGGGTTAAT AAACGGCGTT
CTGGATCTGA CCCCCCACGA GTTGATCGGC GAGGTCCATG GCGCTGATAT TTATACTCCC
CTGCGGCCGC GCCTGGAAGC TGCAGGCAGG CGGGGGATTC CCCAAGTTGT TTCCTTGGGC
GGCCTGGATT ACTTCTGTTT TGGACCGGCA GATACCATAC CGCAGCGTTT CCAAGGCCGG
AAGACCCACT ACCATAACCC CTACAATACC AATGTCCGGG CTACCGGGGG TGAACTGGCC
CAGGTAGGCG AAGTCATGGC CGCCAAGCTA AATGCCGCTC GCGGTCCGGT GGTGGTGATG
GTCCCTCTCA AGGGCTGGTC GGAAAACGGC CGGGCCGGTG GCCCCCTGTA CGATCAGGAA
GCCGACGCCG CTCTGGTGGC GTCCCTGGAG GCCAACCTGA ATCCCGGGAT AAAACTTATG
AAACTCAACG CCCATATTAA CGACCCGATC TTCGCCGCCA GCGCCGTCGC TGTTTTGCAC
CAGTTGATGG AGGTTTCTCG GCCGGTAGAT GGCACCTTTC CCAGGGAAGC CGTGGAGAAG
GGCACACTCC CTCCAAAAAA CCCGAAATGG AGGCGATCGT TAACGCCAGA AAGCGCAATC
GTAAAGCAAG CACCAAGGTG A
 
Protein sequence
MTAKDIAIIA TVDTKEAEAR FLQEFITSHG WQAPVLDVST HRPHNFQATY SREEICRRAG 
VEYKDLGTLR RDAMMATMGR GAARVLMELY DRGELAGVLG IGGNQGTAIA AMAMRSLPVG
LPKLIVSTVA SGNVRPYVEY KDITMMFSVA DLLGGPNTVS RTILSNAAGA VIGMAAWGQP
LKAGERPVIA ITALGNTDPA VAAARGRLVE LGYEVIAFHA SGTCGSAMEE LIEAGLINGV
LDLTPHELIG EVHGADIYTP LRPRLEAAGR RGIPQVVSLG GLDYFCFGPA DTIPQRFQGR
KTHYHNPYNT NVRATGGELA QVGEVMAAKL NAARGPVVVM VPLKGWSENG RAGGPLYDQE
ADAALVASLE ANLNPGIKLM KLNAHINDPI FAASAVAVLH QLMEVSRPVD GTFPREAVEK
GTLPPKNPKW RRSLTPESAI VKQAPR