Gene Moth_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0427 
Symbol 
ID3830951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp430377 
End bp431627 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content38% 
IMG OID637828362 
Producthypothetical protein 
Protein accessionYP_429301 
Protein GI83589292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0723207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTGA TCACACAAGA AAATTTAGCA TTGGTAGTGA CCAAGGAAGT AGAGCAGGTT 
CGTGTAACAG ATATCCATAC CCATCTTTAT CCACCCAATT TTGGAGATCT ATCATTATAT
GGAATAGACG AGCTGTTAAC CTATCATTAT CTTGTTGCCG AGTTTTTTAG ATATTCTACC
ATGGACTATG AAGATTTTTT TAATCTATCC AAAACACAGC AGGCTGAACT AATCTTCCAG
ACATTATTTT TAGAGCACTC ACCTGTGAGT GAGGCTCAGC GCGGTGTTTT AACGACGTTG
AAAGAACTGG GGATGGATTT AAATGTAAGG GATTTACGTG TCTTCAGGGA ACAAATCAAC
TCGATTCCGG CATTTGATTA TGTTGATAGG ATTTTTGCGA TAGCTGGAAT AAAAGAAGTT
GTTATGACTA ATGATCCCTT TGATCCCAAA GAAAGACAAT TATGGGAAAC GAAAGGCAAT
AAAGATCCGA GATTTAAAGC TGCCCTAAGA CTTGATGTAC TTTTGAATAA CTATGAAAAA
AACTATGAGT ACTTAAATCA AATGGGTTTT TTGGTTGATA AGAAGCTGGA TGAGAATACA
TTAACTGAAA TAAGACGTTT TCTTCGTTAT TGGATTGAAA AAACCAATGC CATCTATTTA
GCGGTCTCAT TACCACCAGA TTTTATGGTT CCTGAAGATT CCTGCCGCTC GAGAATACTA
GAAAAATGCG TCCTCCCAAT TTGCAGGGAA TTAAATATTC CCCTGGCTTT AATGATTGGG
GTGAGGAGAT CAATAAACCC CAGGTTAGGC CTGGCGGCGG ATTCTTTAGG AAAAGCTGAT
ATAAGGGCAA TCGAATACTT GTGCAGGACT TATCCTGAAA ATAAATTTTT AGTAACCATG
CTATCTAGGG AAAATCAACA TGAACTTGTA GTAACAGCAA GGAAATTTAG GAATTTAATG
GTATTTGGTT GCTGGTGGTT TTTAAATAAT CCCATGATAG TTGAAGAGAT TACAAATATG
CGATTGGAAA ATTTGGGTTT GTCATTTATT CCCCAGCACT CAGATGCTCG CGTCCTGGAA
CATCTCATCT ATAAATGGGT ACATGCCAGG AAGATAATAG CCGATGTTCT GACTAAAAAG
TACCTAGATC TTTTAGAAAG CGGTTGGAGA GTAACAGAAG AAGAAATTAA GAGGGATATA
GAGGATTTGT TTGGCAATAA TTTCTGGAAG TTTGTCGGGC GAAATGTTTA A
 
Protein sequence
MPVITQENLA LVVTKEVEQV RVTDIHTHLY PPNFGDLSLY GIDELLTYHY LVAEFFRYST 
MDYEDFFNLS KTQQAELIFQ TLFLEHSPVS EAQRGVLTTL KELGMDLNVR DLRVFREQIN
SIPAFDYVDR IFAIAGIKEV VMTNDPFDPK ERQLWETKGN KDPRFKAALR LDVLLNNYEK
NYEYLNQMGF LVDKKLDENT LTEIRRFLRY WIEKTNAIYL AVSLPPDFMV PEDSCRSRIL
EKCVLPICRE LNIPLALMIG VRRSINPRLG LAADSLGKAD IRAIEYLCRT YPENKFLVTM
LSRENQHELV VTARKFRNLM VFGCWWFLNN PMIVEEITNM RLENLGLSFI PQHSDARVLE
HLIYKWVHAR KIIADVLTKK YLDLLESGWR VTEEEIKRDI EDLFGNNFWK FVGRNV