Gene Arth_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1854 
Symbol 
ID4445613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2085208 
End bp2086812 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content64% 
IMG OID639689669 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_831341 
Protein GI116670408 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.138166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCTGGG AATCAATTGA CGAGGCACCG TGTCCGGCAG CGCCGGCACA AGCCGCGCAC 
CCGGACGAGG TGATCCGGAC TGACATTGCC ATCATCGGAT CAGGCATGGG CGGAGGCACC
ATGGCCTACG CCCTCAAGGA CTCCGGCGCG AAGGTCCTGG TCATCGAACG CGGCCACCGC
CTCCCTGTTG AACCGGCAAA TTCCGACCTG GACGAGGTAT ACCTCAAGGG CCGCTACAAA
AACGCCGACA CCTGGTACAA CGGCCGCACC GGGGCACCCT TCAAACCGGG CGTCTTCTAC
TGGGTGGGAG GCAACACCAA GGTCTACGGC GCCTGCCTGC CGCGTTTCCG CGCCAGCGAC
TTCGCGGAAA CCCGGCACGC GGACGGAATC TCGCCGGCCT GGCCCTTCAG CTACGAAGAC
CTCGAACCGT ACTACACCCG TGCCGAACAG CTGTTCCAGG TCCGCGGCCA GATCGGCGAG
GACCCCACCG AACCCATGCA CTCAGAGGCT TACCCGCACG CGCCGCTAGA CCACGAACCA
GCCATCCAGT CACTGGCGGA TTCCTTCCGC CGGCAGGGCC TGCATCCGTT CCGAATGCCC
AACGGGCTGG AGGCCGAAAC CCAGGACAAA CGCGCACTGT GCGCCACCAG CGACGGTGCG
CCGAACGAAG CCGGGTTAAA GTCCGACGCC GAAAAGGTCG CCATCAATCC GGCGCTGGCT
GCCGGCGTCG AACTTCTAGC CGATACCAAA GTCATCCGGC TCCTCACCGG TGATGACGGC
CGGACTGTCG TCGCGGCCCT CGCCGAGCAG AACGACCGCA TCATCCGCAT CGAAGCGGAC
AAATTCATCC TCGCCGCCGG GGCCGTGAAC TCGGCAGCGC TCCTGCTGAA CTCGGCCACT
CCCGAGATTC CTGGCGGCCT GGCCAACTCC TCCGGACTCG TGGGCCGGAA CTACATGGTG
CACAACAGCA CCTTCTTCAT CGGCATCAAC CCGTTCAAAG TCAACCGGAC GCTGTGGCAG
AAAACCCTGG GTCTCAACGA TTGGTACGAA GCCGGACCCG CCAACCAGTT CCCCTTGGGG
AACCTGCAGA TGCTCGGAAA ACTACGTGCC AGCATGCTCA AGATGGCCAG GCCCTGGGCA
CCAACCTGGC TCCTGAAAGC CATGTCGGAC CGCAGCATCG ACATTTACCT CACCACCGAG
GACCTCCCCC GGCGGAGCAA CGGCATCAGC GTTGTCAACG GCAGGATCAA CGTCTGGTGG
AAACCGAATA ACCTCGGCCC CCACAAGGAA TTGGTCCACC GGATCAGCAA GGCCGTCCGG
AAAGCCGGCT ACCCAGCCAT CTTCACCGAG CGCATGGGCA TCGAAACCAA CTCCCACCAG
TGCGGAACAG CCGTGGCAGG ACACGATCCC TCCACCAGCG TTCTCACCCC TGACTGCCGA
GCCCATGACT TGGACAACCT CTGGGTGGTC GACAGCTCCT TCTTTCCCTC ATCGGCGGCA
CTCAACCCGG CGCTGACCAT CGCAGCCAAT GCCCTCCGCG TGGCGGACAC CATTCTCGCG
GCCCGCGACC CGGCCGCCCA ACGGCTCTCC GCAGGAAAGG ATTGA
 
Protein sequence
MSWESIDEAP CPAAPAQAAH PDEVIRTDIA IIGSGMGGGT MAYALKDSGA KVLVIERGHR 
LPVEPANSDL DEVYLKGRYK NADTWYNGRT GAPFKPGVFY WVGGNTKVYG ACLPRFRASD
FAETRHADGI SPAWPFSYED LEPYYTRAEQ LFQVRGQIGE DPTEPMHSEA YPHAPLDHEP
AIQSLADSFR RQGLHPFRMP NGLEAETQDK RALCATSDGA PNEAGLKSDA EKVAINPALA
AGVELLADTK VIRLLTGDDG RTVVAALAEQ NDRIIRIEAD KFILAAGAVN SAALLLNSAT
PEIPGGLANS SGLVGRNYMV HNSTFFIGIN PFKVNRTLWQ KTLGLNDWYE AGPANQFPLG
NLQMLGKLRA SMLKMARPWA PTWLLKAMSD RSIDIYLTTE DLPRRSNGIS VVNGRINVWW
KPNNLGPHKE LVHRISKAVR KAGYPAIFTE RMGIETNSHQ CGTAVAGHDP STSVLTPDCR
AHDLDNLWVV DSSFFPSSAA LNPALTIAAN ALRVADTILA ARDPAAQRLS AGKD