Gene Moth_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1040 
Symbol 
ID3831846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1067225 
End bp1068325 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content52% 
IMG OID637828968 
Producthypothetical protein 
Protein accessionYP_429897 
Protein GI83589888 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID[TIGR02872] sporulation integral membrane protein YtvI 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00428171 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000129007 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
TTGGCCGGCC TGAACGGGCG CTTTCAACAG ACTTTCCAGT CTCTATTACT AGCCTTAATG 
GCTGCCGTCC TGTTCCTGTT GCTTTATTAT TACATATTTC CTGCGGCCAG GGAGATTATC
AAGACCCTGG TCCCTATCGT CTTGCCCTTT GCCCTGGCAG CTTTACTTGC CGCCATTATC
GATCCTGCAG TCAACCTGCT TGAAAAAAAG TTAAAAATAG GACGTGGCTG GGCTGTCATC
ACCACTCTGC TCCTGGTACT GGCCATTATG GGCGTAGCCC TGTTTTACCT GCTCGCCAAT
CTAATTATCG AGCTGGAAAG CCTGGTCCTG AACCTGCCAG CCCAGGCCCG CAGCCTGGGA
GCGCTTCTCC AGGAGTATTT TTACCGCCTG CAGGGCTTTT ATTTCGGCGG CAACCTGCCG
CCGAACATCT TGATTTCCTT TCAATCTTTA TTCAATAATG CCGTTAACGT TTTAAAAGGT
TTCCTTACCC AAACAGTTCA GGGACTGGTT ATTATTGTCA GCTCCCTGCC CGACTTCTTT
ATTTTTGTTA TCATTACCCT GGTGGCCACC TATTTCTTCA GCCGGGATAA GGAACTAATC
CTGCGGACCT TGCTCCGGGT TATGCCGGCC GGGTGGCGGG AACGGACGAG CCGGGTTTTC
AGTTCCCTGG GCCAGGCGAT TATCGGTTAC CTGCGGGCAG AGATCCTGTT AATCAGCCTG
CAGATGACCC AGAGCGTCCT CGGTCTCCTG ATTTTAAAGG TGGACTACGC CCTGACCCTG
GCCTTTTTGA TCGGCCTGGC CGACTTACTC CCCATTGTAG GGCCGGGTAC GGTCTTTATC
CCCTGGATCA TTATTGAATT TATCCTCGGC CACTACGGCC TGGGGCTGGC CCTGCTGATT
CTCTACGCCT TTATTATCAT CCTGCGCCAG GTACTCCAGC CCAAGCTGGT GGCTGTCAAC
CTGGGCCTGT ACCCTTTAAC CACTTTGATT GTCCTTTATG CCGGCTTAAA GCTCCTGGGA
GTAGTGGGCC TGGCCTTGGG GCCTCTGACC ATTGTTGTTT TAAAGGCCTT TTTCCGTTCC
GGACAGGAGG TTAATAAGTA A
 
Protein sequence
MAGLNGRFQQ TFQSLLLALM AAVLFLLLYY YIFPAAREII KTLVPIVLPF ALAALLAAII 
DPAVNLLEKK LKIGRGWAVI TTLLLVLAIM GVALFYLLAN LIIELESLVL NLPAQARSLG
ALLQEYFYRL QGFYFGGNLP PNILISFQSL FNNAVNVLKG FLTQTVQGLV IIVSSLPDFF
IFVIITLVAT YFFSRDKELI LRTLLRVMPA GWRERTSRVF SSLGQAIIGY LRAEILLISL
QMTQSVLGLL ILKVDYALTL AFLIGLADLL PIVGPGTVFI PWIIIEFILG HYGLGLALLI
LYAFIIILRQ VLQPKLVAVN LGLYPLTTLI VLYAGLKLLG VVGLALGPLT IVVLKAFFRS
GQEVNK