Gene Moth_2488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2488 
Symbol 
ID3831591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2593409 
End bp2594560 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content55% 
IMG OID637830410 
ProductPilT protein-like 
Protein accessionYP_431313 
Protein GI83591304 
COG category[R] General function prediction only 
COG ID[COG4956] Integral membrane protein (PIN domain superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000025338 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0444425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTGC GCATTTTACG TGGGGCCTTT GGCCTGCTGG GAGCGGCAGC CGGCTTCTAT 
GTAGGTCGGG CAGGCTTAAA CCTCTGGCAG GTGGGCGGGG GAGTGAACCC GCCCCCGGGG
TTGAGTTGGA CGCTACTGGC CCTGGTGACT TTCGTGGCCG GCCTCGTGGG ATACGGTGTG
GCCCAACATA TTATTAACCT GGTGACCCAA TCCATGCGCT GGCTGGAAGG GAGGTTGCAA
CGCACCCCGG CCCAGGAGAT AATAAGCGGT GCCCTGGGAC TAATCTGTGG ACTTATAATT
GCTAATTTAT TGGGGGCCTC CTTCTTTCAT TTACCCCTGG TGGGACCCTA TATTCCCATG
GTTGGGAGTA TTCTCTTTGG CTACCTGGGC TGGAGCCTGG GTACCAAGCG CAGGGACGAA
GTCTGGTCCC TCTTTAATAT TTTCCCCCGC TGGGGGGGAA AAGAACGGGA TAAAGGCAAG
GGTGAAAGCG TCCGATCCGG GGCCAAGATC CTTGATACCA GCGTCATTAT CGACGGCCGG
ATCGCTGATA TTATTAAGAG TGGCTTTATA GAGGGAACGA TAGTTATTCC GGCCTTTGTC
CTGGAGGAGT TACGCCACAT AGCCGATTCT TCCGATCTGC TAAAACGCAA CCGCGGCCGG
CGCGGGCTTG ATATCCTGAA CAAAATCCGT AAGGAAACCG GCATTACCGT AAAGGTTTCC
GAGGTTGATT TTGACGATCT GACGGAGGTA GATAGCAAAC TTGTCCGCCT GGCCCAGAAG
ATGGGCGCTC CGGTCCTGAC CAATGATTAT AACCTGAACA AGGTGGCCGA GCTCCAGGGT
GTCCGGGTGT TGAACATCAA CGAACTGGCC AATGCCGTTA AGCCGGTAGT TTTACCGGGA
GAAGAAATGA CTGTTCAGGT GATTAAAGAC GGTAAGGAGA TGGGCCAGGG GGTAGCTTAC
TTAGATGACG GCACCATGAT TGTCGTTGAG AACGGCCGCC GGTTTATCGG CCAGACAATT
GCCGTCCTGG TAACCAGCGT TTTACAGACT GCCGCCGGGC GGATGATCTT TGCCCGGCCC
AAGGCTGCTG ACCGCAAACT TGGTGCCCAT CACCAGGCCC TGGAACGGAG CGAGTACCAG
TGCCTTTCCT GA
 
Protein sequence
MLLRILRGAF GLLGAAAGFY VGRAGLNLWQ VGGGVNPPPG LSWTLLALVT FVAGLVGYGV 
AQHIINLVTQ SMRWLEGRLQ RTPAQEIISG ALGLICGLII ANLLGASFFH LPLVGPYIPM
VGSILFGYLG WSLGTKRRDE VWSLFNIFPR WGGKERDKGK GESVRSGAKI LDTSVIIDGR
IADIIKSGFI EGTIVIPAFV LEELRHIADS SDLLKRNRGR RGLDILNKIR KETGITVKVS
EVDFDDLTEV DSKLVRLAQK MGAPVLTNDY NLNKVAELQG VRVLNINELA NAVKPVVLPG
EEMTVQVIKD GKEMGQGVAY LDDGTMIVVE NGRRFIGQTI AVLVTSVLQT AAGRMIFARP
KAADRKLGAH HQALERSEYQ CLS