Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4634 |
Symbol | |
ID | 5902096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5013681 |
End bp | 5014715 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641565153 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001686252 |
Protein GI | 167648589 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | [TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGGCGCT TTTCACCGAA AACCCTCGAT CTGGACGGCA AGGTCATCCT GGTGACCGGC GGCACCGGTT CGTTTGGCCG TCGCTTCATC GAGACCGTCC TGCGGCGCGC CGCCCCCCGG AAGGTGATCA TCTATTCCCG CGACGAATTG AAGCAGAGCG ATATGCAGCT GGAGCTGCGC GAGCAGTTCG GCGAGGCCGT CTACGGCAAG CTTCGGTTCT TCCTGGGCGA CGTGCGTGAT CGCGAGCGCC TGACCCTGGC CCTGCGCGGC GTCGACATCG TCATCCACGC CGCCGCCCTC AAGCAGGTGC CGGCCGCCGA ATACAACCCG TCAGAATGCA TCCACACCAA CGTGCTGGGC GCCGAGAACG TGGTCTGGGC CAGCCTGACC AACCATGTGA AGCAGGTCGT GGCCCTGTCG ACCGACAAGG CCTGCAACCC GATCAATCTA TACGGCGCGA CCAAGCTGGC CTCGGACAAG ACCTTCGTGG CCGCCAACAA CCTGTCGGGC GATATCGGCA CGCGCTTCGC GGTGGTTCGC TACGGCAATG TGGTGGGATC GCGCGGCAGC GTCGTGCCGC TCTACAAGCG CCTGCTGGCT CAGGGCGCGA CCGAGCTGCC GGTCACCGAC GCGCGGATGA CCCGGTTCTG GATCACGCTC AACGAAGGGG TCGAGTTCGT GCTGTCGTCG CTGGAGATCA TGCGCGGCGG CGAGATCTTC GTGCCCAAGA TCCCGTCGAT GACCATGCCC GACCTGGTCA AGGCCATGTC GCCGGACGTC GGCATGAAGA TCGTCGGCAT CCGCCCGGGC GAGAAACTGC ACGAGATGAT GATCAGCGCC GACGACGCCC GCGCCACCGT CGAGCTGGAC GACCGCTACG TGATCGAGCC GACCTTCGTG GAATATCCCC GCGAGCGCTT CGGCCCGGTC GATGGCGCGA CGCCGGTGGC CGAGGGCTTC AGCTACAGCA GCGACAGCAA CGACGACTGG CTCAGCGAGG ACGGCCTGAA CGCCATGCTG GCCGAGAAGG CCTGA
|
Protein sequence | MGRFSPKTLD LDGKVILVTG GTGSFGRRFI ETVLRRAAPR KVIIYSRDEL KQSDMQLELR EQFGEAVYGK LRFFLGDVRD RERLTLALRG VDIVIHAAAL KQVPAAEYNP SECIHTNVLG AENVVWASLT NHVKQVVALS TDKACNPINL YGATKLASDK TFVAANNLSG DIGTRFAVVR YGNVVGSRGS VVPLYKRLLA QGATELPVTD ARMTRFWITL NEGVEFVLSS LEIMRGGEIF VPKIPSMTMP DLVKAMSPDV GMKIVGIRPG EKLHEMMISA DDARATVELD DRYVIEPTFV EYPRERFGPV DGATPVAEGF SYSSDSNDDW LSEDGLNAML AEKA
|
| |