Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0642 |
Symbol | |
ID | 4446890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 687442 |
End bp | 688470 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639688441 |
Product | dehydrogenase |
Protein accession | YP_830141 |
Protein GI | 116669208 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAATT CCCAGCAGCA ATCTGAACAC CGTCCCCAAG CCCGGGCCTA TTGGACTGTC GGCCACGAAA AAGGCGAGCT CCGCACGGAA GAGCTGCCCG CGCCGGGCCC GGGTGAGGCG CTGGTCCGCG CCCTGTATTC GGGAATCAGC AAAGGCACCG AAACCGTGGT CCACTGCGGC AAAGTACCGC CACGGGTTGC CGAACAAATG CGGGCGCCCC TGCAGGAGGG GTCCTTCCCG TCGCCGGTGA AGTTCGGTTA CCTCTCCGTG GGAATCGTGG AGGACGGCCC GGAGGGCTGG GTGGGCCGTA CCGTGTTCTG CCTGCACCCG CACCAGGACC GCTACATTGT TCCGGTCGAG TCCCTGACCG TGGTCCCGGA GAACGTCCCG GCCCGGCGGG CGGTTCTCAC CGGAACCGTC GAAACGGCCG TCAACGCCCT GTGGGAAGCC GGCCCACGGC TCGGTGACCG CGTCGCCGTC GTCGGCGCGG GCCTGGTGGG CGGCATGGTG GCCACCCTCC TTCGCACCTT CCCGCTGCAA CGGCTCCAGC TGGTGGATGT CGATCCGGCG AAGCGGGCGT TCGCCGATGC CCTCGGCGTC GAATTCGCCA ACCCCAACGA CGCACTGGCC GACTGCGACA TCGTTATCCA CTGTTCAGCT TCCCAGGAAG GGCTCGAACG CAGCCTCCAG CTGGTGGGCG ACGAAGGGGA CGTCATTGAA ATGTCCTGGT ACGCCGACCG CAAGGTCACC ATCCCGCTGG GGGAGGACTT CCACGCCCGC CGGCTCTCCA TCCGCGCAAG CCAGGTGGGA GTGGTGGCAC GCGCCCGCCG CCACCGCCGG ACCAACGCCG ACCGGCTGGC GCTCGCCGTG TCCCTGCTCA GCGATCCCGT CTACGACACG TTCCTCACCG GCGCATCGTC GTTTGCGGAA CTTCCCGCCG TCGTGCACGA GCTGGCCGAG GGCCGCCTGG ACGCCCTCTG CCACGTCATC GAATACCCTT CCGAACACCC CGCCGAAGAC AAGAGGTAG
|
Protein sequence | MINSQQQSEH RPQARAYWTV GHEKGELRTE ELPAPGPGEA LVRALYSGIS KGTETVVHCG KVPPRVAEQM RAPLQEGSFP SPVKFGYLSV GIVEDGPEGW VGRTVFCLHP HQDRYIVPVE SLTVVPENVP ARRAVLTGTV ETAVNALWEA GPRLGDRVAV VGAGLVGGMV ATLLRTFPLQ RLQLVDVDPA KRAFADALGV EFANPNDALA DCDIVIHCSA SQEGLERSLQ LVGDEGDVIE MSWYADRKVT IPLGEDFHAR RLSIRASQVG VVARARRHRR TNADRLALAV SLLSDPVYDT FLTGASSFAE LPAVVHELAE GRLDALCHVI EYPSEHPAED KR
|
| |