Gene Caul_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1805 
Symbol 
ID5899260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1907601 
End bp1908758 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content69% 
IMG OID641562295 
Productaldose 1-epimerase 
Protein accessionYP_001683432 
Protein GI167645769 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.122387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCG AAATAAGCCG TCGCCGCGCC GCGCGCCCTT CCCTGGCCCT GCTGCTGGCC 
GTCGGCCTGA TTTGCGCCAG CGCCGCCTCG GCCCTCGCCG CCGAAGCCAG CCGCGCGCCC
TACGGCGTGA CGGCGGCCGG CGCGCCGGTC GAGGTCTTCA CCTTGAAGAA CGACCACGGC
ATGACCGTGA AGGTGCTGTC GTACGGCGGG ATCATCACCC AGGTCGATGT CCCCGACCGC
AAGGGCGAGG TCAAGAACGT CGTCCTGGAA CTGGCCGACC TGAAGGCCTA CGAGGGCCGG
GCCAATTTCA GCTCGCTGCT CGGTCGCTAT GCCAACCGGA TCTCGAACGG CGGCTTCACC
CTGGACGGCG TGCGCCACGA CCTGCCCAGC AGCGCCGACG GCGTCTCCTC GCACGGTGGT
TCCACGGGCT TTTCCACCCG CCTCTGGACC GGGACGCCCT TCAAGCGCCA TGGCCAGGCG
GGCGTGACCC TGGCCTACAC CGCCGTCGAT GGCGAAGGCG GCTATCCCGG GACTTTGAAG
GTCGCCGTGA CCTACACCGT CACCCGCCGC GACGCCCTGC GGATCGACTA CCGCGCCACC
ACCGACAAGC CGACGGTGAT CAACCTCAGC CATCACGCCT ATTTCAACCT GGCGGGCGCC
GGCACGGTCC ATGACCAGAC CGCTCAGGTG CTAGCCCAGG CCTTCACCCC GATCAACGCC
CGCAAGCTGC CGACCGGCGA GGTCGCCCCC GTGGCGGGCA CGGCCCTGGA CCTGCGCCAG
CCGGCGCGCA TCGGTGATCG GGTCACGGCC GATGACCCGC AGATCAAGCT CGCCAACGGC
TTCGACCACA ACTTCGTGGT CGACGGCGGC GGACGCGGCA AGCTGGTTCC CGCCGTTCGC
ATGGCCGACC CGGCCAGCGG GCGCACCCTG GAGGTCGCCA CCACCCAGCC GGGCGTCCAA
TTGTACGCCG CCAACAGCTT CAACGGCACG CTCAAGACCC CCGATGGACG CCCGCTGGAC
AGGGGCGCCG GCCTGGCGAT CGAGACCCAG CACTTCGCCG ACAGCCCCAA CCATCCCAAC
TTCCCCTCGA CCGTCCTGCG GCCCGGGCAG GTGTTCAAGC AAACGACCGA ATTCCGGTTT
GGGGTGGCGA AACAATAG
 
Protein sequence
MAVEISRRRA ARPSLALLLA VGLICASAAS ALAAEASRAP YGVTAAGAPV EVFTLKNDHG 
MTVKVLSYGG IITQVDVPDR KGEVKNVVLE LADLKAYEGR ANFSSLLGRY ANRISNGGFT
LDGVRHDLPS SADGVSSHGG STGFSTRLWT GTPFKRHGQA GVTLAYTAVD GEGGYPGTLK
VAVTYTVTRR DALRIDYRAT TDKPTVINLS HHAYFNLAGA GTVHDQTAQV LAQAFTPINA
RKLPTGEVAP VAGTALDLRQ PARIGDRVTA DDPQIKLANG FDHNFVVDGG GRGKLVPAVR
MADPASGRTL EVATTQPGVQ LYAANSFNGT LKTPDGRPLD RGAGLAIETQ HFADSPNHPN
FPSTVLRPGQ VFKQTTEFRF GVAKQ