Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3086 |
Symbol | |
ID | 5900541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3347396 |
End bp | 3348376 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641563589 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_001684711 |
Protein GI | 167647048 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.167482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGGT TGCGGGACGC CGCCACCGCG ATATTGTCCG CCCCCATTCT CAAGCCGCAA CCGTCGCGGC GCGTTTCTCA CGGCAAATTG CAAGTCATGG CTCATTCTCC GTTCAAGGGC GTCTTTCCGG CGCTGGTCAC GCCTTTCCGC GACGGCGCGG TGGACGAGAA GACCTTCGTG GCCCTGGTCG AACGCCAGAT CGCTGGCGGC GTCCACGGCC TGGTGCCGGT CGGCACCACC GGCGAGACCG CCACGCTGAG TCATGACGAG CACAAGCGCG TGGTCGAACT GTGCGTGCAG ACCGCCCGAG GCCGCGTTCC GGTGATCGCC GGGGCCGGCT CCAACTCGAC CGCCGAGGCG ATTGAACTGG TCGCCCACGC CAAGGCCGTC GGCGCCGACG CCGCCCTGGT GGTGTCACCC TACTATGTGC GCCCAAGCCA GGAGGGGATC TACCAGCACT ACAAGGCGAT CAACGACGCC GTTCAGTTGC CGGTGCTGGT CTACAACGTA CCGGGCCGCA CCGGCTCGGA CGTCGCCAAC GAGACCCTGG CCCGCCTGGC CCTGCTACCC AACATCGTCG GCATCAAGGA CGCCACCGGC GACATCACCC GCGCCAGCTT CCAGCGGATC ATGTGCGGTC CAGACTGGGT GATGTTGTCC GGCGACGACC CCACGGCTCT GGGCTATGTC GCCCACGGCG GCCACGGGGT GATCTCGGTG ACCTCCAACG TCGCCCCCGA CGCCTGCGCG ACCTTCATCA ACGCCTGCCT GAACGGCGAG TGGGAGACCG CGCTCTACTG GCAGGACCGC CTGGTGCGGC TGCACAAGGC GCTGTTCCTC GACGCCTCGC CGTCGCCCAC CAAGTTCGCG ATGGCTCACC TGGGTCTGTG CGAGGCTACC ACGCGCCTGC CGATCACGCC GTGCAGCGAG GCCGTCCGTC CGGCGATCCT GGAGGCCATG CGCGAGGCGG GTCTGGTCTG A
|
Protein sequence | MQRLRDAATA ILSAPILKPQ PSRRVSHGKL QVMAHSPFKG VFPALVTPFR DGAVDEKTFV ALVERQIAGG VHGLVPVGTT GETATLSHDE HKRVVELCVQ TARGRVPVIA GAGSNSTAEA IELVAHAKAV GADAALVVSP YYVRPSQEGI YQHYKAINDA VQLPVLVYNV PGRTGSDVAN ETLARLALLP NIVGIKDATG DITRASFQRI MCGPDWVMLS GDDPTALGYV AHGGHGVISV TSNVAPDACA TFINACLNGE WETALYWQDR LVRLHKALFL DASPSPTKFA MAHLGLCEAT TRLPITPCSE AVRPAILEAM REAGLV
|
| |