Gene Moth_1876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1876 
Symbol 
ID3831220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1938775 
End bp1939932 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content62% 
IMG OID637829808 
Producthypothetical protein 
Protein accessionYP_430719 
Protein GI83590710 
COG category[S] Function unknown 
COG ID[COG3949] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC AGTATTCTAC CTGGCAGATT GCCGCTACTT ATATCGGCAC CGTGGTGGGG 
GCCGGGTTCG CTTCCGGCCA GGAAGTGTTG CAGTTCTTCG GGTACTTCGG GCTGCGGGGC
ATCCTGGGCC TGATCCTGGC CACGGCTCTA TTTATCTTTT TTGGCTACAC CGTCCTTAGG
CTGGGCTTTC AATTAAAGGC GGAGTCCCAC CTGGAGGTGA TGCACCGGGC CGGCGGCGCC
TTCATCGGCC GGGCGGTAGA TGCCGTCACC ACCTTCTTCC TATTTGGCGC CCTGGCCGTC
ATGGCTGCCG GTTCGGGGGC CATTTTCAGG CAGGAATTCC ATCTGCCCGT GCTCCTGGGC
AGCAGCCTGC TGATAGCCAT CACCCTGGTA ACTGTCCTGG CGGGCATTGA GAAGGTGATT
GACTCCATCA GTTTGGTAGC CCCGGTCTTG ATAGCCTCTG TACTTGGCAT CAGCCTGGCC
ACGGTGGCTA AAAACCTGCC CGCCCTGGTA GCCAACCTTT CCTGGGAGGA GACTTACCGG
GCCGCCGTAT CTTCCTGGCC CCTGGCGGCC CTCCTCTATG CTTCTTACAA CCTGGTATTA
TCCATTGCTG TCCTGGGCCC CCTGGGAGCC CTGGCCCGGC AGGAGCGCCT CTTGCCGGGG
GCCTTCCTGG GGGGCCTGGG CCTGGGGCTG GGAGCCATAG CCATTACCCT GGCCCTGATC
ACCACGGCCC CGGCAGTAAC GGCCCTGGAA GTGCCCATGC TGTATATAGC CGGCAGTTTC
AGCCCCGTCC TGCGCATTTT TTACAGCGCC GTCCTGCTGG CGGAGATCTA TACTACTGCT
GTCAGCAGCC TCTACGGTTT TGCCGCCCGC CTGGCCGGAC CGGGAGGAAA TAACTTTCGC
CGGCTGGCTA TAGGAGCCAG TGCCGTGGCC CTGGCAGCCG GCCAGGCCGG CTTTTCCCGC
CTGGTGGCCA CCCTTTTCCC CCTGGTGGGT TACGCCGGTT TCCTGCTCCT CGGAGGCCTG
GCCTATTACG TTCTAAAAGA AATCCTGGCT CTACGACCGG CATTTCCAGG TCGCCTGGTC
CCTGCCCCGG CCCGCAGGCC GATTTTGGGG GCGGTTTTAG AGAGAAGGGG AAAGGCGGGC
GAGAAGGAAC GCCCTTAG
 
Protein sequence
MAKQYSTWQI AATYIGTVVG AGFASGQEVL QFFGYFGLRG ILGLILATAL FIFFGYTVLR 
LGFQLKAESH LEVMHRAGGA FIGRAVDAVT TFFLFGALAV MAAGSGAIFR QEFHLPVLLG
SSLLIAITLV TVLAGIEKVI DSISLVAPVL IASVLGISLA TVAKNLPALV ANLSWEETYR
AAVSSWPLAA LLYASYNLVL SIAVLGPLGA LARQERLLPG AFLGGLGLGL GAIAITLALI
TTAPAVTALE VPMLYIAGSF SPVLRIFYSA VLLAEIYTTA VSSLYGFAAR LAGPGGNNFR
RLAIGASAVA LAAGQAGFSR LVATLFPLVG YAGFLLLGGL AYYVLKEILA LRPAFPGRLV
PAPARRPILG AVLERRGKAG EKERP