Gene Arth_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2095 
Symbol 
ID4445364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2362914 
End bp2364551 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content67% 
IMG OID639689903 
Productglucose-6-phosphate isomerase 
Protein accessionYP_831575 
Protein GI116670642 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.218405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAC TCAGCTACGA CGCCACCGGC GCAGCCCGTG AGGCACTTGA GCAGCACCTC 
CCCGCGCTGC TGGAAGACCG CATCGCCACC AGGATTTTCG CGAAGGACCA CACCTTGTGG
GGTCCGGACG CCGAACAGGA ATCCGCTGTC CGGCTGGGCT GGGTCGAGGC GGCCACCGTC
TCGCAGCCTC TGGTCAGTGA CATCCTGGAA CTCCGCGACG CACTGAAGGC CGAGGGTGTC
ACGCGCATTG TCCTCTGCGG CATGGGCGGA TCCTCGCTGG CGCCTGAAGT CATTGCGGGC
ACCGCTGGCG TCGAGCTGAC TGTGCTTGAC AGCACCGACC CCGAACAGGT CAGTGCTGCC
CTTGTGGACC GCCTTGCCGA AACTGCCATT GTCGTTTCGT CCAAGTCCGG CTCCACCGTC
GAAACGGATT CCCAGCGGAG GATCTTCGAG CAGGCCTTCA CCGATGCCGG CGTTGACGCG
AAGAGCCGCA TCGTCATCGT CACTGACCCG GGTTCGCCGC TGGACAAGGC ATCACGTGAG
GCCGGCTACC GCGCCGTCTT CAACGCCGAC CCCAACGTCG GCGGACGCTT CTCGGCACTG
ACCGCTTTCG GACTGGTCCC CTGCGGGCTG GCCGGCGTGG ACATCCAGGC GTTCCTGGAC
GAGGCCGAAG AAGCAGCCGA GATCCTCAAC GAGGACGCCC CCGAAAACAT CGGCCTCGCC
CTGGGGACCG CGCTGGGCGG CACCAACCCG CTGCGCAACA AGATCGTCAT CGCGGAAGAC
GGGTCCGGCA TTGTCGGTTT CGCCGACTGG GCCGAACAGC TCATCGCCGA ATCCACCGGC
AAGCTCGGCA CCGGAGTCCT GCCGGTCGTC GCCGGTCCGT CAGCTCCGGA GGTCACCTCC
GGCGCTGCCG ACATCCTGGT GGTGCGTCTG GTTGCTGCCG ACGCCGACGA CGTCCAGCTC
GGTGATAACG AGGTTGCCAT CGCCGGGGGC CTGGCAACGC AGATGATGGT GTGGGAATTC
GCGACGGCCG TTGCGGGGCG CCTGCTGGGC ATCAACCCGT TTGACCAGCC CGACGTCGAA
GCCGCCAAGG TGGCCGCACG CGGCCTGCTG GACGCGCAGC CGGAACCGAC TCCCGCCAAC
TTCGTTGACG GCGCCATTGA GGTCCGCGGC GGCGACTGGC TGGGAGATGC CTCGACGGCG
TCTGCCGCAG TGTCCGCGTT GCTCGCCCAG CTCGCCGACG ACAGCTACCT CAGCGTCCAG
GCCTACTTCG ACCGGCTGGC ATATGCTCCG TTTGAAGGTG TCCGCGACCA GCTCGCAGCA
GTCAGCGGAC GTCCTGTGAC GTTCGGCTGG GGCCCGCGGT TCCTGCACTC CACTGGCCAG
TTCCACAAGG GCGGACCCGC CATCGGCGTG TTCCTGCAGG TCACTGCCGA GTCCGCAACG
GATCTGTCTA TCCCGGAGCG GCCGTTCACG TTCGGTGAGC TGATCTCGGC GCAGGCCGCC
GGTGACGCCC AGGTTCTCAG CGACCACGGA CGTCCGGTCC TCCGCCTGCA CCTCACTGAC
CGCGCCTCAG GCGTCAAGCA GCTGCAGGAT ATCGTTTCAG AGCTTGCCGG CCAGGCAGCA
TCCCACACTG AAAGCTAA
 
Protein sequence
MSTLSYDATG AAREALEQHL PALLEDRIAT RIFAKDHTLW GPDAEQESAV RLGWVEAATV 
SQPLVSDILE LRDALKAEGV TRIVLCGMGG SSLAPEVIAG TAGVELTVLD STDPEQVSAA
LVDRLAETAI VVSSKSGSTV ETDSQRRIFE QAFTDAGVDA KSRIVIVTDP GSPLDKASRE
AGYRAVFNAD PNVGGRFSAL TAFGLVPCGL AGVDIQAFLD EAEEAAEILN EDAPENIGLA
LGTALGGTNP LRNKIVIAED GSGIVGFADW AEQLIAESTG KLGTGVLPVV AGPSAPEVTS
GAADILVVRL VAADADDVQL GDNEVAIAGG LATQMMVWEF ATAVAGRLLG INPFDQPDVE
AAKVAARGLL DAQPEPTPAN FVDGAIEVRG GDWLGDASTA SAAVSALLAQ LADDSYLSVQ
AYFDRLAYAP FEGVRDQLAA VSGRPVTFGW GPRFLHSTGQ FHKGGPAIGV FLQVTAESAT
DLSIPERPFT FGELISAQAA GDAQVLSDHG RPVLRLHLTD RASGVKQLQD IVSELAGQAA
SHTES