Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1840 |
Symbol | |
ID | 3831700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1896719 |
End bp | 1897543 |
Gene Length | 825 bp |
Protein Length | 274 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637829771 |
Product | formamidopyrimidine-DNA glycosylase / DNA-(apurinic or apyrimidinic site) lyase |
Protein accession | YP_430683 |
Protein GI | 83590674 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.649087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAC TACCCGAAGT AGAAACCATT AAACGGACAC TGACTCCCTG CTTACGGGAG CAAAAAATCG CCAGGGTGGA GGTCTACCAC CCTGGCGTTA TTGCTGCCCC CGACCCGGAG ACCTTCAGCC GGTTACTGGC CGGCAGGATT ATTACCGGCC TGGACCGGCG GGGCAAGTAC CTACTGGTGC ATCTTTCAGG AGAATACTGT TTGGTGGTTC ACCTGCGCAT GACCGGGCGC CTGGTTTTTA CTGAAGGGGC AGCCCCCCTG GCTCCCCATA CCCACGTCGT TTTTAGCCTG GCTGGGGGAC CTTCTTTACG CTTTGTAGAC ACCCGCCGCT TCGGCCGTCT CTACCTGGCC GCGAAGGCGG AGGTAGAAAC GCTGCCCGGC TTGCGGGACC TGGGGCCGGA GCCCCTGGAC CCGGCCTTTG ACGCCCTGGC CCTGGCGGCT ATTCTAGCCG GCCGGCGCAG GCCAATTAAA CAGGTCCTCC TGGACCAGCG CCTGGTGGCC GGGATCGGCA ATATCTATGC CGATGAGATG CTTTTTGCCG CCGGCATCGA CCCCCGGCGC CCGGCGGCCT CCCTGAATCA TGAGGAGGTG GCTCGCCTGC GCGGGGCTAT GCAAAGGGTC CTGGAGCAGG GGATTGCCAA CCGTGGCACC TCAATCAGGG ATTATGTTGA CGGCAGCGGC CGCCAGGGGA GCAACCAGGA GCACCTCCAG GTTTATGGCC GGACAGGCCG ACCATGCCCC CGCTGCGGGC AACCCCTGGA GAGGGTGCGC CTCGGCGGCC GCAGCACCCA TTTTTGCCCC CGCTGCCAGG TATAA
|
Protein sequence | MPELPEVETI KRTLTPCLRE QKIARVEVYH PGVIAAPDPE TFSRLLAGRI ITGLDRRGKY LLVHLSGEYC LVVHLRMTGR LVFTEGAAPL APHTHVVFSL AGGPSLRFVD TRRFGRLYLA AKAEVETLPG LRDLGPEPLD PAFDALALAA ILAGRRRPIK QVLLDQRLVA GIGNIYADEM LFAAGIDPRR PAASLNHEEV ARLRGAMQRV LEQGIANRGT SIRDYVDGSG RQGSNQEHLQ VYGRTGRPCP RCGQPLERVR LGGRSTHFCP RCQV
|
| |