Gene Arth_3726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3726 
Symbol 
ID4443727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4196599 
End bp4198302 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content67% 
IMG OID639691550 
Productcholine dehydrogenase 
Protein accessionYP_833201 
Protein GI116672268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.191041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAGA CCAGCTACGA CTACGTCATC GTCGGTGGGG GAAGTGCCGG TTCCGTGCTG 
GCAAACCGCC TGAGCGCAGG GGGCACGCGC AGCGTCCTGG TTCTGGAAGC GGGACGAAGC
GACTACCCCT GGGATCTGTT CATCCAGATG CCGGCTGCCC TGACCTTCCC CAGCGGGAAT
CCTCTCTATG ACTGGCGCTA CCAGTCGGAT CCGGAGCCGC ATATGGGGGG ACGCCGGGTG
GCCCATGCCC GCGGCAAGGT CCTGGGCGGC TCGAGCTCCA TCAACGGCAT GATCTTCCAG
CGTGGAAACC CGCTGGACTA CGAACGCTGG GGAGCCGACG ACGGGATGGA AACCTGGGAT
TTCGCGCACT GCCTGCCGTA CTTCAACCGG ATGGAAAACG CGCTCGCGGC GGATCCGGAC
GATGACCTCC GCGGCCACTC GGGACCCTTG GTCCTGGAGC GCGGCCCTGC CACCAACCCG
CTGTTCCAAG CCTTCTTCAA GGCAGCACAG GAAGCGGGAT TCCCGCTGAC GGATGACGTG
AACGGCTACC GCCAGGAGGG CTTCGCGGCG TTTGACCGGA ACGTGCACAA GGGGCAGCGG
CTTTCCGCGT CCCGGGCCTA CCTGCGCCCC GGGGCCAAGC GGCCCAACCT GACGGTCCGC
ACCCGCGCCC TGGTCACGAA GGTGAACTTC AAGGGCAACG TTGCCACCGG CGTCACCTAC
CGCCGCAACG GCAGGACGCA CCAGGTGAAC GCCGGAGAGG TGATCCTGTC CGGCGGCGCC
ATCAATACCC CCCAGCTGCT GCAGCTTTCC GGCATCGGGG ACGCCACCCA CCTCAAGTCG
CTCGGCATCA AGCCCGTGGT CCACCTCCCT GGCGTGGGCG AGAACCTGCA GGACCACCTG
GAGGTCTACA TCCAGCACGC CTGCACCCAG CCGGTGTCCA TGCAGCCGAA CCTTGACCTG
TGGCGCTACC CGCTCATCGG CCTCCAGTGG CTCCTGGGCC GCAAGGGTCC CGCGGCCACC
AACCACTTCG AGGGCGGCGG GTTCGTCCGC TCCAACGATG AGGTGGCGTA CCCCAACCTG
ATGTTCCACT TCCTCCCCGT CGCCGTGCGG TACGACGGCC AAAAGGCGGA TGCGAAGCAC
GGCTACCAGG TGCACATCGG CCCCATGTAT TCCGACGCCC GCGGCAGCCT CAAGATCACA
TCCACGGATC CCACCGTGCA CCCCTCCATG GTGTTCAACT ACCTCTCCAC CGACCAGGAC
CGCCGCGAAT GGGTGGAGGC CATCCATATC GCCCGCGACA TCCTCGGCCA GTCCGCCATG
GGCCCCTTCA ACGGCGGGGA GCTTTCCCCT GGCCGGAGTG TCCAGACCGA CGCCGAAATC
CTGGACTGGG TGGCGCGCGA CGCCGAAACA GCCCTGCATC CGTCGTGCAC CGCGAAGATG
GGGCCGGAAT CGGACCCGAT GGCCGTGGTC AATCCGCTCG ACATGAGCGT GCACGGGGTC
AAGGGCCTCC GCGTGGTGGA TGCCTCGGCC ATGCCGTACG TGACCAACGG CAACATCTAC
GCCCCGGTGA TGATGCTCGC CGAGAAGGCA GCCGACCTGA TTGCCGGAAC GGCCCCGTTG
GCCCCGCGGC ATGCCGAGTT CTACCGCCAT GGGCACAGCC CGCTGATGCG GGACCAGGCC
GCCGCCGCAG CGGCGAAGGG CTAG
 
Protein sequence
MTETSYDYVI VGGGSAGSVL ANRLSAGGTR SVLVLEAGRS DYPWDLFIQM PAALTFPSGN 
PLYDWRYQSD PEPHMGGRRV AHARGKVLGG SSSINGMIFQ RGNPLDYERW GADDGMETWD
FAHCLPYFNR MENALAADPD DDLRGHSGPL VLERGPATNP LFQAFFKAAQ EAGFPLTDDV
NGYRQEGFAA FDRNVHKGQR LSASRAYLRP GAKRPNLTVR TRALVTKVNF KGNVATGVTY
RRNGRTHQVN AGEVILSGGA INTPQLLQLS GIGDATHLKS LGIKPVVHLP GVGENLQDHL
EVYIQHACTQ PVSMQPNLDL WRYPLIGLQW LLGRKGPAAT NHFEGGGFVR SNDEVAYPNL
MFHFLPVAVR YDGQKADAKH GYQVHIGPMY SDARGSLKIT STDPTVHPSM VFNYLSTDQD
RREWVEAIHI ARDILGQSAM GPFNGGELSP GRSVQTDAEI LDWVARDAET ALHPSCTAKM
GPESDPMAVV NPLDMSVHGV KGLRVVDASA MPYVTNGNIY APVMMLAEKA ADLIAGTAPL
APRHAEFYRH GHSPLMRDQA AAAAAKG