Gene Caul_4634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4634 
Symbol 
ID5902096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5013681 
End bp5014715 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content66% 
IMG OID641565153 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001686252 
Protein GI167648589 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID[TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGGCGCT TTTCACCGAA AACCCTCGAT CTGGACGGCA AGGTCATCCT GGTGACCGGC 
GGCACCGGTT CGTTTGGCCG TCGCTTCATC GAGACCGTCC TGCGGCGCGC CGCCCCCCGG
AAGGTGATCA TCTATTCCCG CGACGAATTG AAGCAGAGCG ATATGCAGCT GGAGCTGCGC
GAGCAGTTCG GCGAGGCCGT CTACGGCAAG CTTCGGTTCT TCCTGGGCGA CGTGCGTGAT
CGCGAGCGCC TGACCCTGGC CCTGCGCGGC GTCGACATCG TCATCCACGC CGCCGCCCTC
AAGCAGGTGC CGGCCGCCGA ATACAACCCG TCAGAATGCA TCCACACCAA CGTGCTGGGC
GCCGAGAACG TGGTCTGGGC CAGCCTGACC AACCATGTGA AGCAGGTCGT GGCCCTGTCG
ACCGACAAGG CCTGCAACCC GATCAATCTA TACGGCGCGA CCAAGCTGGC CTCGGACAAG
ACCTTCGTGG CCGCCAACAA CCTGTCGGGC GATATCGGCA CGCGCTTCGC GGTGGTTCGC
TACGGCAATG TGGTGGGATC GCGCGGCAGC GTCGTGCCGC TCTACAAGCG CCTGCTGGCT
CAGGGCGCGA CCGAGCTGCC GGTCACCGAC GCGCGGATGA CCCGGTTCTG GATCACGCTC
AACGAAGGGG TCGAGTTCGT GCTGTCGTCG CTGGAGATCA TGCGCGGCGG CGAGATCTTC
GTGCCCAAGA TCCCGTCGAT GACCATGCCC GACCTGGTCA AGGCCATGTC GCCGGACGTC
GGCATGAAGA TCGTCGGCAT CCGCCCGGGC GAGAAACTGC ACGAGATGAT GATCAGCGCC
GACGACGCCC GCGCCACCGT CGAGCTGGAC GACCGCTACG TGATCGAGCC GACCTTCGTG
GAATATCCCC GCGAGCGCTT CGGCCCGGTC GATGGCGCGA CGCCGGTGGC CGAGGGCTTC
AGCTACAGCA GCGACAGCAA CGACGACTGG CTCAGCGAGG ACGGCCTGAA CGCCATGCTG
GCCGAGAAGG CCTGA
 
Protein sequence
MGRFSPKTLD LDGKVILVTG GTGSFGRRFI ETVLRRAAPR KVIIYSRDEL KQSDMQLELR 
EQFGEAVYGK LRFFLGDVRD RERLTLALRG VDIVIHAAAL KQVPAAEYNP SECIHTNVLG
AENVVWASLT NHVKQVVALS TDKACNPINL YGATKLASDK TFVAANNLSG DIGTRFAVVR
YGNVVGSRGS VVPLYKRLLA QGATELPVTD ARMTRFWITL NEGVEFVLSS LEIMRGGEIF
VPKIPSMTMP DLVKAMSPDV GMKIVGIRPG EKLHEMMISA DDARATVELD DRYVIEPTFV
EYPRERFGPV DGATPVAEGF SYSSDSNDDW LSEDGLNAML AEKA