Gene Moth_1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1957 
Symbol 
ID3832308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2035361 
End bp2036686 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID637829888 
Productputative chlorohydrolase/aminohydrolase 
Protein accessionYP_430798 
Protein GI83590789 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR03314] putative selenium metabolism protein SsnA 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.157555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTCA TCGGTAACGG CCGCCTACTT TCCCTGGTGC CGGGGAAACC CTACCAGGAA 
GACGGAGCGG TGGCCATTGA CGGCGACCTG ATCGTCGCCG TAGGGCCCAC GGCTGAACTT
AAAGCGCGCT ACCCGGCAGC CGAGTGGCTG GACGCCCGGG GCATGGTCAT TATGCCCGGG
ATGATCAATA CCCACATGCA CCTGTACAGC ACCTTTGCCC GGGGCATGGC CCTGAAGGAC
CCGCCGCCAA CCAATTTCCT GGAGATCCTC CAGCGCCTGT GGTGGCGCCT GGACAAAGCC
CTGACCCTGG AAGACGTTTA CTACAGCGCC CTCCTGCCCC TGATCGACTG CATCAAGAGC
GGAACTACCA CCATTCTGGA TCACCACGCC AGCCCCTATG CCGTTACCGG CAGCCTGGAG
ATGATCGCCC GGGCCGCCAT GGAGACGGGG GTACGCACCT GCCTGGCCTA CGAAGTCTCC
GATCGCGATG GTAAGGAAAT TATGCGGCAG GGGATTGCGG AAAACATCGC CGCCATAAAG
AAATACCGGG GCAAAGAGGG TTTAATTTCA GCTACCTTTG GCCTCCACGC CTCTCTGACC
CTCTCCGACG CCACCCTGGA GGCCTGCCGG GAGGCCGAAG GGGAGGTGGG CAGCGGCTTT
CACATCCACG TGGCCGAGGG GATCCAGGAC GTCGAGGACG CTTTAGCCAA ATCGGGGAAG
CGGGTGGTGG AACGCCTGGC TGTTAACGGC ATCCTGGGAC CCAATACCAT CGCCGCCCAT
TGCGTGCACG TGACGGACAG GGAAATAGCC ATTCTGAAGG AGACAGGCAC CCTGGTGGTC
CACAACCCGG AATCAAATAT GGGCAATGCC GTCGGCTGTG CTCCGGTTGG CGACATGCTG
GCCGCAGGTG TCCCCGTCGG CCTGGGAACG GACGGCTATA CCAGCGATAT GTTCGAGTCC
CTAAAGACCG CCAACGTCCT GCGCAAATTC GTCTCCGGCG ATCCGGGCGC CGGCTGGGCA
GAGGTCCCGG CCATGGCCTT TGAAAACAAC CGCCGCATCG CGAGCCGCTT CTTCCCCCAC
CCCCTGGGCC GTCTGGAGCC AGGTGCCTAT GCCGATGTGA TCCTGGTGGA CTACCAGGCG
CCAACACCCC TGGGAAGGGA CAACTGGTTC GGCCACCTCC TCTTCGGCTT CAACGGCGGC
CTGGTGGATA CTACCGTTGT CGGCGGTAAA GTACTCATGC AAAGGCAGCG CTTGCTGCAC
CTGGACGAGG CGGCCATCGC CGCCCGGGCC CGGGAACTGG CGATCAAAGT CTGGGAGCGG
TTTTAA
 
Protein sequence
MLLIGNGRLL SLVPGKPYQE DGAVAIDGDL IVAVGPTAEL KARYPAAEWL DARGMVIMPG 
MINTHMHLYS TFARGMALKD PPPTNFLEIL QRLWWRLDKA LTLEDVYYSA LLPLIDCIKS
GTTTILDHHA SPYAVTGSLE MIARAAMETG VRTCLAYEVS DRDGKEIMRQ GIAENIAAIK
KYRGKEGLIS ATFGLHASLT LSDATLEACR EAEGEVGSGF HIHVAEGIQD VEDALAKSGK
RVVERLAVNG ILGPNTIAAH CVHVTDREIA ILKETGTLVV HNPESNMGNA VGCAPVGDML
AAGVPVGLGT DGYTSDMFES LKTANVLRKF VSGDPGAGWA EVPAMAFENN RRIASRFFPH
PLGRLEPGAY ADVILVDYQA PTPLGRDNWF GHLLFGFNGG LVDTTVVGGK VLMQRQRLLH
LDEAAIAARA RELAIKVWER F