Gene Caul_4609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4609 
Symbol 
ID5902071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4985312 
End bp4986367 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content69% 
IMG OID641565128 
Productxylose isomerase domain-containing protein 
Protein accessionYP_001686227 
Protein GI167648564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0474989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGC TCAAGGGACC CGGCATCTTC CTGGCCCAGT TCATCGGCGC CGAGCCGCCG 
TTCGACAAGC TGGAGACCAT GGCCGCCTGG GTCGCCGATC TCGGCTATGT CGGGGTGCAG
ATGCCGACCG GCGGCGCGGA CTCGTTCTTC GACCTGGCCC TGGCCGCCGA GAGCCAGACC
TATTGCGACG ACATCGCCGG GTTGCTGGCC GGTCATGGCC TGCGGATCAC CGAGCTGTCG
ACCCACCTGC AGGGTCAGCT GGTCGCCGTG CACCCGGCCT ATGACGAGCT GTTCGACGGC
TTTGCCCCGC CTGAACTGCG CGGCAGGCCA GTCGAGCGCC AGGTCTGGGC GGTGGGCCAG
CTGAAGGCCG CGGCCGTCGC CAGCCGGCGG CTCGGGCTGA ACGCCCACGC CACCTTCTCC
GGCGCCCTGG CCTGGCCGTA CTTCTATCCC TGGCCGCAGC GCCCAGCGGG TCTGATCGAG
GAGGCGTTCG CCGAACTGGG CCGCCGCTGG AAGCCGATCC TGGACGTCTT CGACAACGAG
GGCGTCGATG TCTGCTACGA GATCCATCCG GGCGAGGACC TGCACGACGG CGCGACGTTC
GAGCGGTTCC TCGACGAGGT CGGCGGCCAC GCGCGGGCCA ATATTCTCTA TGATCCCAGC
CACTTCGTGC TGCAGCAGCT GGACTACCTA GGCTACATCG ACCGCTATCA CGAGCGGATC
CGCGCCTTCC ACGTCAAGGA CGCCGAGTTT CGGCCGTCGG CCCGGTCGGG CGTCTATGGC
GGCTACCAAG GCTGGGCCGA GCGACCCGGC CGCTTCCGAT CGCTGGGCGA TGGCCAGGTG
GACTTCAAGG CGATCTTCTC CAAGCTGGCC CAGTACGACT ATCCCGGCTG GGCGGTGCTG
GAGTGGGAGT GCGCGCTGAA ACATCCGGAG CAGGGCGCCC GCGAAGGCGC GCCGTTCATC
CGCGACCACA TCATCCAGGT GACCGACCGG GCGTTCGACG ACTTCGCCGG CAGCGCGCCC
GACGGCGACC GCAATCGCCG GCTCCTGGGC CTGTAG
 
Protein sequence
MKTLKGPGIF LAQFIGAEPP FDKLETMAAW VADLGYVGVQ MPTGGADSFF DLALAAESQT 
YCDDIAGLLA GHGLRITELS THLQGQLVAV HPAYDELFDG FAPPELRGRP VERQVWAVGQ
LKAAAVASRR LGLNAHATFS GALAWPYFYP WPQRPAGLIE EAFAELGRRW KPILDVFDNE
GVDVCYEIHP GEDLHDGATF ERFLDEVGGH ARANILYDPS HFVLQQLDYL GYIDRYHERI
RAFHVKDAEF RPSARSGVYG GYQGWAERPG RFRSLGDGQV DFKAIFSKLA QYDYPGWAVL
EWECALKHPE QGAREGAPFI RDHIIQVTDR AFDDFAGSAP DGDRNRRLLG L