Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1957 |
Symbol | |
ID | 3832308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2035361 |
End bp | 2036686 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829888 |
Product | putative chlorohydrolase/aminohydrolase |
Protein accession | YP_430798 |
Protein GI | 83590789 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR03314] putative selenium metabolism protein SsnA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.157555 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCTCA TCGGTAACGG CCGCCTACTT TCCCTGGTGC CGGGGAAACC CTACCAGGAA GACGGAGCGG TGGCCATTGA CGGCGACCTG ATCGTCGCCG TAGGGCCCAC GGCTGAACTT AAAGCGCGCT ACCCGGCAGC CGAGTGGCTG GACGCCCGGG GCATGGTCAT TATGCCCGGG ATGATCAATA CCCACATGCA CCTGTACAGC ACCTTTGCCC GGGGCATGGC CCTGAAGGAC CCGCCGCCAA CCAATTTCCT GGAGATCCTC CAGCGCCTGT GGTGGCGCCT GGACAAAGCC CTGACCCTGG AAGACGTTTA CTACAGCGCC CTCCTGCCCC TGATCGACTG CATCAAGAGC GGAACTACCA CCATTCTGGA TCACCACGCC AGCCCCTATG CCGTTACCGG CAGCCTGGAG ATGATCGCCC GGGCCGCCAT GGAGACGGGG GTACGCACCT GCCTGGCCTA CGAAGTCTCC GATCGCGATG GTAAGGAAAT TATGCGGCAG GGGATTGCGG AAAACATCGC CGCCATAAAG AAATACCGGG GCAAAGAGGG TTTAATTTCA GCTACCTTTG GCCTCCACGC CTCTCTGACC CTCTCCGACG CCACCCTGGA GGCCTGCCGG GAGGCCGAAG GGGAGGTGGG CAGCGGCTTT CACATCCACG TGGCCGAGGG GATCCAGGAC GTCGAGGACG CTTTAGCCAA ATCGGGGAAG CGGGTGGTGG AACGCCTGGC TGTTAACGGC ATCCTGGGAC CCAATACCAT CGCCGCCCAT TGCGTGCACG TGACGGACAG GGAAATAGCC ATTCTGAAGG AGACAGGCAC CCTGGTGGTC CACAACCCGG AATCAAATAT GGGCAATGCC GTCGGCTGTG CTCCGGTTGG CGACATGCTG GCCGCAGGTG TCCCCGTCGG CCTGGGAACG GACGGCTATA CCAGCGATAT GTTCGAGTCC CTAAAGACCG CCAACGTCCT GCGCAAATTC GTCTCCGGCG ATCCGGGCGC CGGCTGGGCA GAGGTCCCGG CCATGGCCTT TGAAAACAAC CGCCGCATCG CGAGCCGCTT CTTCCCCCAC CCCCTGGGCC GTCTGGAGCC AGGTGCCTAT GCCGATGTGA TCCTGGTGGA CTACCAGGCG CCAACACCCC TGGGAAGGGA CAACTGGTTC GGCCACCTCC TCTTCGGCTT CAACGGCGGC CTGGTGGATA CTACCGTTGT CGGCGGTAAA GTACTCATGC AAAGGCAGCG CTTGCTGCAC CTGGACGAGG CGGCCATCGC CGCCCGGGCC CGGGAACTGG CGATCAAAGT CTGGGAGCGG TTTTAA
|
Protein sequence | MLLIGNGRLL SLVPGKPYQE DGAVAIDGDL IVAVGPTAEL KARYPAAEWL DARGMVIMPG MINTHMHLYS TFARGMALKD PPPTNFLEIL QRLWWRLDKA LTLEDVYYSA LLPLIDCIKS GTTTILDHHA SPYAVTGSLE MIARAAMETG VRTCLAYEVS DRDGKEIMRQ GIAENIAAIK KYRGKEGLIS ATFGLHASLT LSDATLEACR EAEGEVGSGF HIHVAEGIQD VEDALAKSGK RVVERLAVNG ILGPNTIAAH CVHVTDREIA ILKETGTLVV HNPESNMGNA VGCAPVGDML AAGVPVGLGT DGYTSDMFES LKTANVLRKF VSGDPGAGWA EVPAMAFENN RRIASRFFPH PLGRLEPGAY ADVILVDYQA PTPLGRDNWF GHLLFGFNGG LVDTTVVGGK VLMQRQRLLH LDEAAIAARA RELAIKVWER F
|
| |