Gene Arth_2679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2679 
Symbol 
ID4444733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3003375 
End bp3004373 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content62% 
IMG OID639690499 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_832158 
Protein GI116671225 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0614199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAC TCCTTGTCAC CGGCGGTGCC GGCTTCATCG GTTCCAATTT TGTTCACTAC 
GTTCTTGAGA ACACTGATGA TCACGTCACT GTTCTGGACA AGCTGACGTA CGCAGGCAAC
CTGGAATCCC TGAGCGGGCT CCCGGAGGAG CGCTTCCGCT TCGTGCAGGG CGATATCTGC
GACGCCGCAC TGGTGGACAC GCTCGTGGCC GATGCCGACG TCGTGGTCCA CTACGCCGCC
GAGTCGCACA ACGATAACTC GCTGCATGAC CCGCGGCCGT TCCTGGACAC GAACATCATC
GGCACCTACA CGCTGATCGA GGCCGCCCGG AAGCACAACA AGCGCTTCCA CCACATCTCC
ACCGACGAGG TCTACGGGGA CCTGGAACTC GATGACCCGG AGCGGTTCAC GGAAGAGACT
CCGTACAACC CCTCGAGCCC GTACTCCTCC ACGAAGGCCG GCTCTGACCT GCTGGTTCGC
GCCTGGGTCC GTTCCTTCGG GCTGCAGGCG ACCATCAGCA ACTGCTCGAA CAACTACGGC
CCGTACCAGC ACGTGGAGAA GTTCATCCCG CGCCAGATCA CCAACGTGAT CGACGGGATC
CGGCCCAAGC TCTACGGCAA GGGCGAGAAC GTCCGCGACT GGATCCACGC CAACGACCAC
TCCTCGGCCG TGCTGGCCAT CATCGCCAAG GGAAAAATCG GCGAAACCTA CCTGATCGGC
GCGGACGGCG AGAAGAACAA CAAGGACGTC GTGGAGCTCA TCCTCAAGCA CATGGGCCAG
TCCCCGGACG CCTACGACCA CGTCGTGGAC CGCCCCGGCC ATGACCTGCG CTACGCCATC
GACTCCACCA AGCTCCGCAA CGAGCTCGGC TGGGAACCGA AGTTCTCCAA CTTCGACGCC
GGCATCGAGG ACACCATCGC CTGGTACCGC GAGAACGAAA ACTGGTGGCG CCCGCAGAAA
GCCCAGACCG AAGCGAAGTA CAAGGAACAG GGCCAGTAG
 
Protein sequence
MQKLLVTGGA GFIGSNFVHY VLENTDDHVT VLDKLTYAGN LESLSGLPEE RFRFVQGDIC 
DAALVDTLVA DADVVVHYAA ESHNDNSLHD PRPFLDTNII GTYTLIEAAR KHNKRFHHIS
TDEVYGDLEL DDPERFTEET PYNPSSPYSS TKAGSDLLVR AWVRSFGLQA TISNCSNNYG
PYQHVEKFIP RQITNVIDGI RPKLYGKGEN VRDWIHANDH SSAVLAIIAK GKIGETYLIG
ADGEKNNKDV VELILKHMGQ SPDAYDHVVD RPGHDLRYAI DSTKLRNELG WEPKFSNFDA
GIEDTIAWYR ENENWWRPQK AQTEAKYKEQ GQ