Gene Caul_3086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3086 
Symbol 
ID5900541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3347396 
End bp3348376 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content68% 
IMG OID641563589 
Productdihydrodipicolinate synthase 
Protein accessionYP_001684711 
Protein GI167647048 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.167482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGGT TGCGGGACGC CGCCACCGCG ATATTGTCCG CCCCCATTCT CAAGCCGCAA 
CCGTCGCGGC GCGTTTCTCA CGGCAAATTG CAAGTCATGG CTCATTCTCC GTTCAAGGGC
GTCTTTCCGG CGCTGGTCAC GCCTTTCCGC GACGGCGCGG TGGACGAGAA GACCTTCGTG
GCCCTGGTCG AACGCCAGAT CGCTGGCGGC GTCCACGGCC TGGTGCCGGT CGGCACCACC
GGCGAGACCG CCACGCTGAG TCATGACGAG CACAAGCGCG TGGTCGAACT GTGCGTGCAG
ACCGCCCGAG GCCGCGTTCC GGTGATCGCC GGGGCCGGCT CCAACTCGAC CGCCGAGGCG
ATTGAACTGG TCGCCCACGC CAAGGCCGTC GGCGCCGACG CCGCCCTGGT GGTGTCACCC
TACTATGTGC GCCCAAGCCA GGAGGGGATC TACCAGCACT ACAAGGCGAT CAACGACGCC
GTTCAGTTGC CGGTGCTGGT CTACAACGTA CCGGGCCGCA CCGGCTCGGA CGTCGCCAAC
GAGACCCTGG CCCGCCTGGC CCTGCTACCC AACATCGTCG GCATCAAGGA CGCCACCGGC
GACATCACCC GCGCCAGCTT CCAGCGGATC ATGTGCGGTC CAGACTGGGT GATGTTGTCC
GGCGACGACC CCACGGCTCT GGGCTATGTC GCCCACGGCG GCCACGGGGT GATCTCGGTG
ACCTCCAACG TCGCCCCCGA CGCCTGCGCG ACCTTCATCA ACGCCTGCCT GAACGGCGAG
TGGGAGACCG CGCTCTACTG GCAGGACCGC CTGGTGCGGC TGCACAAGGC GCTGTTCCTC
GACGCCTCGC CGTCGCCCAC CAAGTTCGCG ATGGCTCACC TGGGTCTGTG CGAGGCTACC
ACGCGCCTGC CGATCACGCC GTGCAGCGAG GCCGTCCGTC CGGCGATCCT GGAGGCCATG
CGCGAGGCGG GTCTGGTCTG A
 
Protein sequence
MQRLRDAATA ILSAPILKPQ PSRRVSHGKL QVMAHSPFKG VFPALVTPFR DGAVDEKTFV 
ALVERQIAGG VHGLVPVGTT GETATLSHDE HKRVVELCVQ TARGRVPVIA GAGSNSTAEA
IELVAHAKAV GADAALVVSP YYVRPSQEGI YQHYKAINDA VQLPVLVYNV PGRTGSDVAN
ETLARLALLP NIVGIKDATG DITRASFQRI MCGPDWVMLS GDDPTALGYV AHGGHGVISV
TSNVAPDACA TFINACLNGE WETALYWQDR LVRLHKALFL DASPSPTKFA MAHLGLCEAT
TRLPITPCSE AVRPAILEAM REAGLV