Gene Caul_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3985 
Symbol 
ID5901447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4314749 
End bp4315804 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content71% 
IMG OID641564506 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001685608 
Protein GI167647945 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.59438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATC CTGCCTCCGC CGCCCCAGGC GCGGCGACGA CCGCGCACGC CTCCACGCGC 
CGCCTCTTCG TGACCGGCGC CGCCGGATAT GTCGGGCGCA ACCTGGTCCA TTGCTTCCTG
CGCGACGGCG TCGAAGTCGT CGCCCTGGTG CGAACCACCG AAGCCGCCGA GCGCCTGCGG
GCGATGGGCG CCCTGGCGGT GGTCGGCGAC ATCCTTGATC CGGGGATCGG CGCGGCCATG
GCCGGCTGCG ACGCCCTGGT CCATGCGGCG GCCGACACGG ACCACAGTTA CGGCGGCGCG
GCGCAGATGC GAACCAACGC CACGGGAACC GAGACCGTGC TGCGCGCCGC CCGGGTCGCG
GGCGTCCGCC GCGCCATCGT GTTGAGCACG GAATCGGTCC TGGCCGACGG TCGCCCGCTC
AGGAACGTCG ACGAGACGCG CGCCTATCCG ACCCGCCCCG CCGGTGCCTA TTCGCGAAGC
AAGATCGCGG CCGAGAAGAT CGCCCTGTCC CTGAACGACG AGACCTTCGC GGTCATCATC
GTGCGGCCGC GTTTCGTCTG GGGCCGCGAC GACACCACGG CCCTGCCGAT GCTGGTGGAA
GCCGCCCGCT CCGGAGAGCT CGCCTGGATC GACGGCGGCG GCTATCTAAC CTCGACCATC
CACATCGACA ACCTGTGCCA CGGCGTCGAC CTGGCGCTAA AGGCTGGGCG CGGCGGCGAG
ATCTATTTCC TGTCCGACGG CGAGCCGGTC GCGTTCCGGA CGATCGTTTC AGCCCTTCTG
GAGACCCAGG GCGAAGCGGC GCCGGACAAG GTCGCGCCGC GCCCGCTGGT TCGCATGGTC
GCCGCCGTGG GCGACCTGAT CGGCGCGGCG ACGCGCGGTC GAAAGCCTGT CCCGCTCACC
CTGCAAGGCT TCGCCGCTTC GGCCGTCGAG GTGACGCTCG ACATCGGCAA GGCGCGGCGC
GAGCTTGGCT ATGCTCCGGT CGTCTCGATG GCCGAGGGCC TGGCGGAACT GTCCGCTTCG
GCGCGGCGGC GCGGGCTAGG GCGTGCGGAC TGTTGA
 
Protein sequence
MTNPASAAPG AATTAHASTR RLFVTGAAGY VGRNLVHCFL RDGVEVVALV RTTEAAERLR 
AMGALAVVGD ILDPGIGAAM AGCDALVHAA ADTDHSYGGA AQMRTNATGT ETVLRAARVA
GVRRAIVLST ESVLADGRPL RNVDETRAYP TRPAGAYSRS KIAAEKIALS LNDETFAVII
VRPRFVWGRD DTTALPMLVE AARSGELAWI DGGGYLTSTI HIDNLCHGVD LALKAGRGGE
IYFLSDGEPV AFRTIVSALL ETQGEAAPDK VAPRPLVRMV AAVGDLIGAA TRGRKPVPLT
LQGFAASAVE VTLDIGKARR ELGYAPVVSM AEGLAELSAS ARRRGLGRAD C