Gene Caul_3613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3613 
Symbol 
ID5901068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3898451 
End bp3900253 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content69% 
IMG OID641564124 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001685238 
Protein GI167647575 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.772791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCTG CCGCCGTTCC GCCCCGCGCC CTTCGTTCGC GCGCCTGGTT CGACAATCCG 
GACAATGCGG ACATGACGGC GCTCTACCTG GAGCGTTACC TGAACTATGG CCTGTCGCTG
GAAGAACTGC AGTCAGGCAA GCCGATCATC GGCATCGCCC AGACCGGCAG CGACCTGTCG
CCCTGCAACC GCCACCACCT GGTTCTGGCC GAACGCGTGC GCGAAGGCAT CCGCACGGCC
GGCGGCATCG CCCTGGAGTT TCCGGTCCAT CCGATCCAGG AGACCGGCAA GCGCCCGACC
GCGGGCCTGG ACCGCAACCT CTCCTACCTC GGTCTGGTCG AAATCCTGTA CGGCTATCCG
ATCGACGGCG TGGTTCTGAC CATCGGCTGC GACAAGACCA CGCCAGCCTG TCTGATGGCC
GCCGCCACCG TCAACATCCC GGCCATCGCC CTGTCGGTCG GACCGATGCT GAACGGCTGG
CACAAGGGTG AGCGCACGGG CTCGGGCACC ATTGTCTGGA AGGCTCGCGA AATGCTGGCG
GCCGGCGAGA TCGACCGCGC CGGCTTCATC AAGCTGGTGG CCAGTTCCGC CCCCTCGACC
GGCTATTGCA ACACCATGGG CACGGCCACG ACCATGAACT CGCTGACCGA GGCCTTGGGC
ATGTCGCTGA CGGGCTCGGC GGCGATTCCC GCCCCGTACC GCGACCGGCA ACAGAACGCC
TACGAGACCG GCCTGCGGAT CGTCGAGCTG ACCGAGCAGG ACATCAAGCC GTCCGACATC
CTGACCCGCG ACGCCTTCCT CAACGCCGTG GTCGTCAATT CGGCGATCGG CGGCTCGACC
AACGCCCCGA TCCACCTCAA CGCCCTGGCG CGCCATATCG GCGTCGAGCT CAGCGTCGAC
GACTGGCAGG CCTATGGCGA AGAGGTGCCG CTGCTGGTCA ACCTGCAGCC GGCCGGCGAA
TATCTGGGCG AGGACTATTA CCGGGCCGGC GGCGTGCCGG CCGTGGTCAA CCAGTTGATG
GGCCAGGGTC TGATCCGCGA GGACGCCCTG ACCGTCTCGG GCCAGACCCT GGGCGAGGCC
TGCCGGAACG CGGCGATCGA GGACGAGGCG GTGATCCGCC CCTTCGACAA GCCGCTGGTC
GAGCGCGCGG GCTTCGTGGT CATGCGCGGC AACCTGTTCA ACTCGGCGAT CATGAAGACC
AGCGTGATCA CCGCCGAGTT CCGCGACCGC TATCTGTCGA ACCCCGACGA CCCGGACGCC
TTCGAGGGCG AGGCGGTGGT GTTCGACGGA CCCGAGGACT ACCACCGCCG CATCGACGAT
CCGGCGGTCG GGATCACCGA GCGCAGCGTG CTGTTCATGC GCGGGGCCGG GCCGATCGGC
TATCCTGGCG CGGCCGAGGT CGTGAACATG CGGGCCCCGG ACTACCTGAT CAAGCGCGGG
ATCCACCAAC TGCCCTGCAT CGGCGACGGG CGCCAGTCGG GCACCTCGGG CTCGCCCTCG
ATCCTCAACG CCTCGCCGGA GGCGGCGGCC GGCGGCGGCC TGGCCCTGCT GAAGTCCGGC
GACAAGGTGC GGGTCGATCT GCGCAAGTCG CGGGTCGATG TGCTGGTCAC GCCCGAAGAG
GTGGTCGCGC GGCGCGCCGC GCTCGAGGCG GCCGGCGGCT ACGCCTATCC CGAAAGCCAG
ACGCCCTGGC AGGAGATCCA GCGCGGCATC ATCGGCCAGA TGGACACCGG CGCGGTGCTG
GAGCCGGCGG TCAAGTACCA GCGCATCGCC CAGACCAAGG GCCTGCCGCG GGACAACCAC
TGA
 
Protein sequence
MSSAAVPPRA LRSRAWFDNP DNADMTALYL ERYLNYGLSL EELQSGKPII GIAQTGSDLS 
PCNRHHLVLA ERVREGIRTA GGIALEFPVH PIQETGKRPT AGLDRNLSYL GLVEILYGYP
IDGVVLTIGC DKTTPACLMA AATVNIPAIA LSVGPMLNGW HKGERTGSGT IVWKAREMLA
AGEIDRAGFI KLVASSAPST GYCNTMGTAT TMNSLTEALG MSLTGSAAIP APYRDRQQNA
YETGLRIVEL TEQDIKPSDI LTRDAFLNAV VVNSAIGGST NAPIHLNALA RHIGVELSVD
DWQAYGEEVP LLVNLQPAGE YLGEDYYRAG GVPAVVNQLM GQGLIREDAL TVSGQTLGEA
CRNAAIEDEA VIRPFDKPLV ERAGFVVMRG NLFNSAIMKT SVITAEFRDR YLSNPDDPDA
FEGEAVVFDG PEDYHRRIDD PAVGITERSV LFMRGAGPIG YPGAAEVVNM RAPDYLIKRG
IHQLPCIGDG RQSGTSGSPS ILNASPEAAA GGGLALLKSG DKVRVDLRKS RVDVLVTPEE
VVARRAALEA AGGYAYPESQ TPWQEIQRGI IGQMDTGAVL EPAVKYQRIA QTKGLPRDNH