Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3036 |
Symbol | |
ID | 4444300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3402296 |
End bp | 3403216 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639690860 |
Product | taurine dioxygenase |
Protein accession | YP_832515 |
Protein GI | 116671582 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.163614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA TCACCGAAAC CAAGCTCGAA TTCGCCAAAC TGGGTTCCCG CATCGGGGCT GAAATCCGTG GCCTGGACCT GGGCGGAGAC CTCTCCGCCG AGACCGTCGC ACAGATCCGG GCCGCGCTCA ACGAACACAA GGCGCTGGTG TTCCGCGAGG CCAACATCCT CACGGACGAG GCCCAGGTAA AGTTCGCCGG CCACTTCGGT CCGCTCACGA AGGCGCACCC CACCGTGGCC TCCGTGGAAG GCAAGGAAAG CGTCCTGCCG GTGGACAGCG AGAACGGCTC CGCCAACAAC TGGCACACGG ATGTCACGTT CGTGGTCAAC CCGCCGCAGG CCTCCACCCT GCGCAGCATC GACCTCCCCG CGTACGGCGG CGAAACGCTG ATCGCGTCCT CGGCCGGCGC CTACCGCGAC CTGCCCGAGG AGCTGCGGAA CTTCGCGGAC ACCCTCTGGG CCATCCACAC GAACGACTAC GACTACTCGG TGCCGAAGAA CCTGGAGCAC GAAAACGCTG AGGAGCGCCG GAAGGAGTTC ACCCGGCTGA AGTTCGAGAC GGCCCACCCG GTGGTCCGGG TCCACCCGCT GACCGGCGAG CGCGGATTGT TCATTGGCGG CTTCGCGCAG CGGCTGCGGA TCGTGGGGCT GTCCAACACG GAGTCGAAGG ACATCATCCG GCTGCTGCAG GCCTACGTCA CGCGTCCGGA GAACGTGGTG CGGGTGAACT GGGAGCCGAA CCAGCTGGTG CTCTTCGACA ACCGCATCAC CCAGCACTAC GCCCCGGACA ACTATGACGG CCAGCCGCGC AAGCTCAACC GCGTGACCAT TGCCGGCGAC ATCCCCGTGG GCATCGACGG CAAGCCGAGC CAGGCCCTGC AGGGCGACTC CTCCACCTAC TCGGTGGTGG CGCCGCTCTA G
|
Protein sequence | MTTITETKLE FAKLGSRIGA EIRGLDLGGD LSAETVAQIR AALNEHKALV FREANILTDE AQVKFAGHFG PLTKAHPTVA SVEGKESVLP VDSENGSANN WHTDVTFVVN PPQASTLRSI DLPAYGGETL IASSAGAYRD LPEELRNFAD TLWAIHTNDY DYSVPKNLEH ENAEERRKEF TRLKFETAHP VVRVHPLTGE RGLFIGGFAQ RLRIVGLSNT ESKDIIRLLQ AYVTRPENVV RVNWEPNQLV LFDNRITQHY APDNYDGQPR KLNRVTIAGD IPVGIDGKPS QALQGDSSTY SVVAPL
|
| |