Gene Arth_2543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2543 
Symbol 
ID4444951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2853293 
End bp2854426 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content66% 
IMG OID639690360 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_832022 
Protein GI116671089 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0277395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGGG AGGGAACGGG GCCTGCGTCC AGGCCTGCAC TCCGCCCAAT GCCTGCCGCC 
GCCCTCGTAT GGACGCTGGT TTTGTCCGCC TGCACCGGAG GCGGGGACGG CGGTCCCACG
GGCACCACGG GTTCAACCAC GGGCGGCGAC GAGGCGTCCC GGGAACCCCG CGTACTCCGG
ACGGTGGAAG GACTGCAGCT GCCCTGGTCG GCCGTGTTCC TGCCGGATGG CACAGCCCTC
ATTTCTGAAC GTGACAGCGG CGACGTCAAA GCGGTCAAAG ACGGCGGGAC TACGCTGCTA
GGCAACATCC CCGGCGTTGT TCCCGGAGGT GAAGGCGGCC TCCTGGGGCT GGCGGTGTCA
CCGAGCTACG TTTCGGACAA GTCGATCTTT GCCTATTTCA CCGCCCGGGC GGACAACAGG
ATTGCCCGCC TCACGCTGAC TGAGGCCGAG CCCGGGGGCG CGCTGAGGCT TGGGCCGCCG
GAGATAATCT TCTCCGGCAT CCCCAAGGCG TCAACCCACA ACGGTGGCCG CATACGTTTT
GGGCCGGACG GGAACCTCTA TGTGGGAACC GGGGATTCGC AGCGGCGCGA ACAGCCGCAG
GACCCGAACG CGCTGGGCGG CAAGATCCTC CGGATCACTG CTGACGGCAA GCCGGCGCCG
GGTAACCCGT TTGGCGACAA CCCGGTCTAC AGCCTTGGGC ACCGGAACGT GCAGGGCCTC
GACTGGGATG ACGCGGGCAG GCTGTGGTCC AGCGAGTTCG GGCCCACTGT GGACGACGAA
CTGAACCTGA TCCAGCCGGG CGGAAATTAC GGCTGGCCGG AGGTCACCGG GGCACCCGGC
AAGCCGGGCT TTATTGATGC CAAAGTGGTA TGGCCTTCCA CCGCGGAATC TTCCCCGAGC
GGACTCGAGG TCGTCGGGTC CACGGCTTAC CTCGGGGCCC TACGGGGCCA GCGGCTGTGG
GCCATTCCCC TTGACGGCGA AAATGCAGGC AAACCTGTGA GCCATTTCAC AGCGAGGTTC
GGCCGGATCC GCGACGTTTC GCTCGCCCCT GACGGCACTT TGTGGATGCT CACCAACAAC
CAAAACCCTG ATTCTGCGCT GATTTTGGCG CCTCCGGCCA AGGCAGGGAG CTGA
 
Protein sequence
MNREGTGPAS RPALRPMPAA ALVWTLVLSA CTGGGDGGPT GTTGSTTGGD EASREPRVLR 
TVEGLQLPWS AVFLPDGTAL ISERDSGDVK AVKDGGTTLL GNIPGVVPGG EGGLLGLAVS
PSYVSDKSIF AYFTARADNR IARLTLTEAE PGGALRLGPP EIIFSGIPKA STHNGGRIRF
GPDGNLYVGT GDSQRREQPQ DPNALGGKIL RITADGKPAP GNPFGDNPVY SLGHRNVQGL
DWDDAGRLWS SEFGPTVDDE LNLIQPGGNY GWPEVTGAPG KPGFIDAKVV WPSTAESSPS
GLEVVGSTAY LGALRGQRLW AIPLDGENAG KPVSHFTARF GRIRDVSLAP DGTLWMLTNN
QNPDSALILA PPAKAGS