Gene Arth_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1923 
Symbol 
ID4445542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2165218 
End bp2166870 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content63% 
IMG OID639689733 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_831405 
Protein GI116670472 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGGA AGAAGCAGCC AGAGCCGGTA GATGTCGTCA TCGTCGGTGC GGGCGCCGGC 
GGGGCAACCG CGGCAAAGGT GCTGTCGGAG GCCGGCCTGA AGGTGGTAGG CCTCGAGCGT
GGGCCCTGGC TAAAGCCCGA GCACGCCTCG GGGGACGAAC TCAAGTTCCT CAACCGGAAT
TTCATCTGGC AGGACCCGAA GATCAAACCG CGGACCTACC GCCCAAACGA CAAGGTAGAG
GCGGAAATCA CCAACTTTTC CGCCACGCCG CAGGTCGTCG GCGGCGGCAC CACCCACTGG
GGTGGCATGG TTCCGCGTAT GGCGGAGAGC GATTTCAAGC TGCGCAGCCT GCACGGCGAT
GTTCCGGGCG CAAGCCTGGT TGACTGGCCC ATCTCCTATG ACGAACTGGA GCCGTACTAC
ACACGCGTTG AGTGGGAGTT CGGCACGTCG GGCCTTGCCG GAGCGAACAA GTGGGAAGCG
TGGCGCAGCC GCGGCTACCC CACCAAGCCA TCACCGCTGA GCCAGGTTGG ACGGACCTTC
GCCACCGCGA TGTCAAAGCT CGGGCATGGC ACGTTCCCCA TGCCGCAGGG CATGGTCACC
GAACCCTATC GGGGACGTCA GCCATTCAGT GAGAACGGCT TCTGGCAGCA GTACCCGGAC
CCGGGAACAG GCAAGTCATC CACGCTGATC AGCTTCATTC CGGACGCGGT TGCGACCGGG
CGCTACGACC TCCGCTCCGA TTCCTACGTG AGCGAGATAC TCGTAGGCAA GGACGGCCGC
GCCACCGGTG TCCGTTACCA GGACGAGGAC GGCGACGAGT TCGTCCAGCA CGCGAAAGCC
GTGATCGTCT GTGGCGGCGG CATCGAAACC CCACGCCTGC TGCTCATGTC GAAGTCAGGG
CTCTTCCCGG ACGGACTCGG CAACGGCAGC GGCATGGTGG GTAAGAACGC CACCTTCCAT
CAGTACTCAT TCTCGGTCGG CTTGTTCGAC CGCGAGGTCA GCGACCCGCT TTACGGGTGG
GCGGGCCACT ACATGAGCCT GTGTTCGTTC GACTTCTACG AGACGGACGA GAGCCGGGGC
CACATCCTGG GATCACTGAT TTTCCCGTCA ATGATCGGCC ACCCGGTGAA CTGGAGCTTC
CCCGGCCGGC CTACGTGGGG CCAGGCGGCC AAGGACGCCG ACCGCGATTT TTTCAACCAC
AGCATGAAAA TCGGTGTCCT CCTGCACGAC CTGCCGGTGG AGGACAACCG TGTCGACCTT
GACCCGAACG TCAAAGACGC ATGGGGTCTT CCGGTGGCCC GAATTACCCA CACGCCCCAC
TCCAACGACT TTGCCCAGGA ACGCTGGCAG GTTGCCAAGA ACGGGGAAAT CCTCGAGGCT
GCCGGGGCGA AGAAGGTCAT TCCGGTCAAT ATGGAACGGA TCACGGGCAA CACCTCGCAT
GAACTGGGCA CCGCCCGGAT GGGCAATGAT CCGGCGACGT CCGTCGTGGA CCGCTGGTGC
CGCTCGCATG AAGTACCAAA CCTCTACGTT TTCGACGCGA GCTTCTTCCC GACGGCGACC
GGCATCAACC CGGCCCTGAC GATCATGGCC AACGCCTGGA GGTGCTCTGA CCATATCCTC
CAGGTGGACC GCCACGGCTG GTCCGACAAC TGA
 
Protein sequence
MTRKKQPEPV DVVIVGAGAG GATAAKVLSE AGLKVVGLER GPWLKPEHAS GDELKFLNRN 
FIWQDPKIKP RTYRPNDKVE AEITNFSATP QVVGGGTTHW GGMVPRMAES DFKLRSLHGD
VPGASLVDWP ISYDELEPYY TRVEWEFGTS GLAGANKWEA WRSRGYPTKP SPLSQVGRTF
ATAMSKLGHG TFPMPQGMVT EPYRGRQPFS ENGFWQQYPD PGTGKSSTLI SFIPDAVATG
RYDLRSDSYV SEILVGKDGR ATGVRYQDED GDEFVQHAKA VIVCGGGIET PRLLLMSKSG
LFPDGLGNGS GMVGKNATFH QYSFSVGLFD REVSDPLYGW AGHYMSLCSF DFYETDESRG
HILGSLIFPS MIGHPVNWSF PGRPTWGQAA KDADRDFFNH SMKIGVLLHD LPVEDNRVDL
DPNVKDAWGL PVARITHTPH SNDFAQERWQ VAKNGEILEA AGAKKVIPVN MERITGNTSH
ELGTARMGND PATSVVDRWC RSHEVPNLYV FDASFFPTAT GINPALTIMA NAWRCSDHIL
QVDRHGWSDN