Gene Moth_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1143 
Symbol 
ID3833242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1175268 
End bp1176812 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content56% 
IMG OID637829074 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_430000 
Protein GI83589991 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID[TIGR02900] stage V sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGGG TATCCATCTG GCAAGGGACA GTTATCCTGA TGGTGGCCAG CTTATTGAAC 
CGCATCCTGA GCTTTGGCTA CCGAATGCTG GTTGTGCGCT ACATAGGGGC CGAAGGTATG
GGCCTCTATG AAATGGTTTT CCCTTTTTAT AGCCTGGTGC TTATGGTCAC TACCGCCGGC
ATACCCGTTG CCCTGGCTAA ACTAACAGCC GAAAGGATAG CCCTGGCCCG CTGGGGACAG
GTACGCAGCG TCTTTCGCCT ATCCCTTATT TTTTTGACCC TTAGCGGCTT ACTGGCCGCC
TTGGTGTTGT GGCGGCTGGC TCCCTTTTTA ACCGGCAGGA TGTTTGCCGA TACCCGGGTT
TACCAGGCCT TTGTGGTGAT GATCCTGGCC CTGCCGGTAG TCTGTATTTG TTCGGCCTTT
CGGGGCTACT TCCAGGGCTG GCAGTTGATG CGACCGGTGG CTCTGGCCCA GGTTGTTGAA
CAGGTTGTGC GGGTGAGTGC CGGGTTTTTC CTGGGCATTT ATTTGCTCCC CTACGGCGTG
GCCATGGCTG CGGCCGGCCT GGCGGCAGGT ATGGTTTTGG GAGAACTGGC AGGCCTGGGG
ATTAGCGTCT TTATCTTTAA CCTGGCCCGG CCGTATTACG ATATCGCTGC CGACCAGACA
GGCTCGTTGA AAGCAGATAT TCTTCCCCTG GTCCGTTTGG CCATCCCGGT GATGCTGGCC
CGTATGGCCG GAGGTATTAT GTTAACTATT GAAGCCTTGT TAATCCCTCG CCAGCTCCAG
GCCTGGGGGG TGACCATGCG GGAGGCCACA ACCATTTACG GCCAGTATGC AGGCATTGCC
TTTACTTTGA TATACCTGCC CATGGTTATC ACTGTGGCCC TGGCCATGAC CATGGTGCCC
GCCATTTCTG AGGCTAGGGC AGTTGGGGAT TGCGATTTAT TAAATAAACG CTGCCGGCAG
TCTTTAAAGA TGACTATTTA TAGCAGCCTG CCCTTTGCCA TAACCTTTTA CCTCTTTGCC
GCCCCCATTT GCGGCCTTAT CTTCGCCACT CCTGAGGCCG GTATACCCCT GAAAATCCTC
GCCTGGGGGA GTATCTTTAT CTACCTGGAG CAGACTACCG TGGGCATCCT GAATGGCCTG
GGGTCCATGT CTACCATCCT CTGGACGACT GTCGCCGGAG GTATTGTCGA CCTCCTGGGG
ATCTATTACC TGACGCCGGT GCTGGGTATT GCCGGCGCCG CCCTTGGGGT AAACCTGGGA
ACTGCCGTTA CCGCCATCCT GAACCTTCTG GCCCTGATCC GGCAGACCGG TTTTCACCTG
GACTTCCGGA CCTTTGTTTA CTGGCCAGCC GTAGCCGGGG CGGGAATGTT CCTTGGGGCT
TCCCTTCTCT GGCGGCTGCT GGTAGCTACA CCGGAACCAT GGCGCCTGTT CCAGACCCTG
GCCGGTAGCA GTTTGTTTTA CCTGCTGATT CTCCTGGTAG CGGGAGAAAT ATCCCCCGGG
CACTTTTACC TATTCCCCTG GCCGGGCCAG AGGAACGACA AATAA
 
Protein sequence
MAGVSIWQGT VILMVASLLN RILSFGYRML VVRYIGAEGM GLYEMVFPFY SLVLMVTTAG 
IPVALAKLTA ERIALARWGQ VRSVFRLSLI FLTLSGLLAA LVLWRLAPFL TGRMFADTRV
YQAFVVMILA LPVVCICSAF RGYFQGWQLM RPVALAQVVE QVVRVSAGFF LGIYLLPYGV
AMAAAGLAAG MVLGELAGLG ISVFIFNLAR PYYDIAADQT GSLKADILPL VRLAIPVMLA
RMAGGIMLTI EALLIPRQLQ AWGVTMREAT TIYGQYAGIA FTLIYLPMVI TVALAMTMVP
AISEARAVGD CDLLNKRCRQ SLKMTIYSSL PFAITFYLFA APICGLIFAT PEAGIPLKIL
AWGSIFIYLE QTTVGILNGL GSMSTILWTT VAGGIVDLLG IYYLTPVLGI AGAALGVNLG
TAVTAILNLL ALIRQTGFHL DFRTFVYWPA VAGAGMFLGA SLLWRLLVAT PEPWRLFQTL
AGSSLFYLLI LLVAGEISPG HFYLFPWPGQ RNDK