Gene Caul_4688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4688 
Symbolpgi 
ID5902150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5068964 
End bp5070583 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content70% 
IMG OID641565207 
Productglucose-6-phosphate isomerase 
Protein accessionYP_001686306 
Protein GI167648643 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATC TCGCCGCCGC CTGGACCCGC CTGGAAGCCG CCGCCAAGGT CGCCGGCGAA 
AAGCGCATCG CCCTGATGTT CGAGGCCGAG CCGGGGCGGC TCGCGGCGCT GACCTTGAAT
GTCGCCGGCC TGCATATCGA CCTTAGCAAG CAGGCCTGGG ACGAGGCCGG TCTGGAGGCC
GCGCTCGACC TGGCCCACGC GGCCGACGTC GAGGGCGCGC GGACCCGGAT GTTCGGCGGC
GAGGCGATCA ATTCCTCCGA AGGCAGGGCC GTGCTGCACA CCGCCCTGCG CGCCGCCAAG
GACGCCGACG TCCGGGCCGG CGGCGTTCCG GTGATGGCCG AGGTCGAGGC CGTCCGCGTC
CGGATGAAGG CCTTCACCGA GGCCGTGCGC TCCGGCGCGA TCAAGGGCGC CACGGGTAAG
CCGTTCAAGG CCATCCTGCA CATCGGCATC GGCGGCTCGG ACCTTGGCCC CCGCCTGCTG
TGGGACGCCC TGCGCCCCAT CAAGCCGACG ATCGACCTGC GCTTCGTGGC TAATGTGGAC
GGGGCCGAGT TCGCCCTGAC CACGGCCGAC CTCGACCCGG CCGAGACCCT GGTGATCGTG
GTCTCCAAGA CCTTCACCAC CCAGGAGACC CTGGCCAACG CCGCCGCGGC CCGCGCCTGG
CTGTCCGCCG CCCTGGGCGA GCAGGGCGCC AACCAGCATC TGGCGGCGAT CTCCACCGCC
CTGGACAAGA CTGCGGCCTT CGGCGTCGCC GATGACCGGG TGTTCGGCTT CTGGGACTGG
GTCGGCGGCC GCTATTCGCT GTGGTCGTCG GTCAGCCTGT CGGTGGCCGT GGCCTGCGGT
TGGGAGGCGT TCGAAGGGTT CCTTCAGGGC GGCGCCGCCA TGGACGCCCA CTTCCGCGAC
GCGCCGCTCG AGAAGAACGC CGCGGTGCTG ATCGCCCTGG CCCAGATCTT CAACCGCAAT
GGCCTGGACC GCCGCGCCCG CTCGGTCGTG CCCTATTCGC ACCGCCTGCG CCGCCTGGCC
TCGTTCCTCC AGCAACTGGA GATGGAGAGC AACGGCAAGT CGGTCGGGCC CGACGGCCAA
GCCGTCAAGC ACGGCACCGC CACGGTGGTG TTCGGCGACG AAGGCGCCAA TGTCCAGCAC
GCCTATTTCC AGTGCATGCA CCAGGGGACC GACATCACCC CGCTGGAGTT CGTCGCCCTG
GCCCAGTCCG ACGAGGGACC GGCCGGCATG CACGCCAAGC TGCTGTCCAA CGTCCTGGCC
CAGGCCGAGG CGCTGATGGT CGGGCGCACC ATCGAAGACG TCCGCACCGA GCTGGTCGCC
AAGGGCGTCT CCGAAGCCGA GATCGCGACC TTGGCCCCGC AGCGCGCCTT CGCCGGCAAC
CGGCCGTCCA CGATGGTGGT GCTGGACCGC CTGACGCCCC AGACCTTCGG CGCCCTGATC
GCCCTCTATG AGCACAAGAC CTTCGTCGAG GGCGTGATCT GGGGCGTCAA CAGCTTTGAC
CAGTGGGGCG TCGAGTTGGG CAAGGTGATG GCCGGCCGCA TCCTGCCGGA ACTGGAGAGC
GGCGCGGCCG GTCGCCACGA TCCGTCGACG GCGGCGCTGA TCGAGCGGTT GAAGCTCTAG
 
Protein sequence
MADLAAAWTR LEAAAKVAGE KRIALMFEAE PGRLAALTLN VAGLHIDLSK QAWDEAGLEA 
ALDLAHAADV EGARTRMFGG EAINSSEGRA VLHTALRAAK DADVRAGGVP VMAEVEAVRV
RMKAFTEAVR SGAIKGATGK PFKAILHIGI GGSDLGPRLL WDALRPIKPT IDLRFVANVD
GAEFALTTAD LDPAETLVIV VSKTFTTQET LANAAAARAW LSAALGEQGA NQHLAAISTA
LDKTAAFGVA DDRVFGFWDW VGGRYSLWSS VSLSVAVACG WEAFEGFLQG GAAMDAHFRD
APLEKNAAVL IALAQIFNRN GLDRRARSVV PYSHRLRRLA SFLQQLEMES NGKSVGPDGQ
AVKHGTATVV FGDEGANVQH AYFQCMHQGT DITPLEFVAL AQSDEGPAGM HAKLLSNVLA
QAEALMVGRT IEDVRTELVA KGVSEAEIAT LAPQRAFAGN RPSTMVVLDR LTPQTFGALI
ALYEHKTFVE GVIWGVNSFD QWGVELGKVM AGRILPELES GAAGRHDPST AALIERLKL