Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3501 |
Symbol | |
ID | 4072760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4131033 |
End bp | 4132103 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637985524 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_592576 |
Protein GI | 94970528 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGAGCC GCCTGCTCGA CCTGCTCGAT GTGCAACATC CGATCTTTCT TGCGCCGCTC GGCGGCGGGC CGTCCACCCC CGAACTGGCT TCCACGATAG GCAATTCCGG CGGCCTGGGA GCCTTGGCAG CTGCCTATCT CACGCCAGAC CAGCTCATTT ATGACGTACA GAGTGCCCGT AAGCGGACGG ATGCGCCGCT GAACGTCAAT TTATTCGCCG GGGGCTACCA CGCTTCGACG CAGGACGATC CCCGCCCCAT GTTAGGGCTT CTGGAAGCCT CGCACCGCGA AGTAGGCCTG CCAGAGCCAA ACCTTCCTGT CGTTCCGCCA GACCCATTCG ACCAGCAATT TGAGGCTCTT CTATCTGCCA AACCGCGTGT TTTCAGCTTT ACCTTCGGAA TCCCGCTGGC GTCAGCCATC CAGCGCGCCC AAAAACGGGG AATCCTCGTG TTCGGTACGG CGACTACCGT TCGTGAGGGT CAGCTTCTTG CCTCTGCCGG TGTGGATGCC ATCGTCGCCC AGGGTGCCGA GGCCGGAGGC CAGCGCGGCA CCTTCGACGT CTCCTTCGAG GAAGGACTGG TCCCGCTTCG CGCATTGGTC GCCGGACTGG CAAACGCCGT CGCGCTGCCG GTAATCGCTT CTGGGGGCAT TATGAACGGC CGAGAAATCG CCGAGATGCT GCGCCTGGGC GCCAGCGCTG TGCAGCTTGG AACCGTGTTT CTGTGTACTC CCGAGGCTGG CACCTCTGCG CCCTACCGTA AAGCCCTTCT TGATGCCGAA GAGGACCGCA CCAGGATCAC GTATGCATTC ACCGGCCGCG GAGCGCGCGG GATCGAGAAC GCCTTTATGC GGCAAATGGC TGCACATCGC GATGCGATCC TGCCATTTCC CATGCAGAAC CTGCTCACGC GCGATCTTCG CAAAGCTGCG ACCCAGCAGG GCAAACCGGA ATACCTATCA TTGTGGGCTG GAACCGGGGT AGCGCAGATT CGCGCTGAAC CCGCCGCGCA GATCATGCGC CGCTTGGTGG ACGAGATGCA GGAAGCACTT GGTGGGCCCG GTAGGATTTG A
|
Protein sequence | MRSRLLDLLD VQHPIFLAPL GGGPSTPELA STIGNSGGLG ALAAAYLTPD QLIYDVQSAR KRTDAPLNVN LFAGGYHAST QDDPRPMLGL LEASHREVGL PEPNLPVVPP DPFDQQFEAL LSAKPRVFSF TFGIPLASAI QRAQKRGILV FGTATTVREG QLLASAGVDA IVAQGAEAGG QRGTFDVSFE EGLVPLRALV AGLANAVALP VIASGGIMNG REIAEMLRLG ASAVQLGTVF LCTPEAGTSA PYRKALLDAE EDRTRITYAF TGRGARGIEN AFMRQMAAHR DAILPFPMQN LLTRDLRKAA TQQGKPEYLS LWAGTGVAQI RAEPAAQIMR RLVDEMQEAL GGPGRI
|
| |