Gene Caul_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2085 
Symbol 
ID5899540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2234403 
End bp2235524 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content73% 
IMG OID641562574 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001683711 
Protein GI167646048 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.817087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACG GCGCCCCGCC CCAAGCGGCG ATCGTCGCCT GCGTCGGGGT CGCGGCCTGG 
AAGCGGGCCT CGGTGGCGCG GCTGATCGCG GCGGACGGCC GAGCAAGCGG ACCGTCCGCG
GACCTCGGTC GCCCAGCCTT CCGCGCGCGC GCCGGCCCGG CCCTGCGGCG GGCTCGCGAC
CTGGGCGGCG CTGTCGGCGT CTGGCCGTCG CGCGCGCCCG CCGACATCGA AAGCCAGGCC
CAGGCGCTGG GCGTGCCCCT GGTCTGCATC GAGGACGGCT TCATCCGCTC GGCGGGCCTG
GGCGCGGAGT GCCGCCCGCC GGCCTCGATC GTCCTTGATC GCGCCGGCGT CCATTTCGAC
CCCCGCCGCC CGAGCGACCT GGAGTTGCAC CTGACGCATG ACCATTTCGG CGAAACCCTG
ACGTCCCGCG CCCAACAGCT GATCGAACGG ATCGTCGCCC TTGGCGTCAC CAAGTACAAT
CTGTCGGGCC AGGCGCCGCC ATTCCGCGGC GGCCGGCGCA CCGTTCTCGT TCCCGGCCAG
GTCGAGGACG ATCTGTCGGT CAAGCTGGGC GGGGCCGGCG TCGCCGGCAA TCTCGACCTT
CTTCGACGGG TCCGTCAGAT CGAGCCGGAC GCGGTTGTGC TCTACCGCCC CCATCCGGAC
GTCGAGGCGG GGTATCGCAA GGGCCTGATC CGCGACGCCG ACGCGCTGCG ATACGTCGAT
CAGGTCCTCC GCGGCCACGC CCTGCCAGCA CTCCTGACCA GCGTCGATGC GGTCCACGTC
CTCACCTCGC TGACCGGTTT CGAGGCCCTG CTGCGGGGGC GGGAGGTCGT CGTCCACGGC
CAGCCCTTCT ATGCCGGCTG GGGTCTGACG CGGGACCTGT CCCCGCCCCC GCGCCGGGGA
CGGCGGCTCG CCCTGGCCGA GTTGGCCGCC GCCGCCCTGA TCCTCTATCC CCGCTATATC
GACCCCATGA CGGGCGAGAC CTGTTCGCCC GAGACCCTTG TCGCTCGCCT GGCCGACCAG
CCAGAGCCGC GACCGGGCCT GCTGCCGGCG CTCCGGCGCC TACAGGTCGG CGCGTTCCGA
TCCCTCGACA TGGCCGCGGC GACGAGCATG GCCAATGGCT GA
 
Protein sequence
MIDGAPPQAA IVACVGVAAW KRASVARLIA ADGRASGPSA DLGRPAFRAR AGPALRRARD 
LGGAVGVWPS RAPADIESQA QALGVPLVCI EDGFIRSAGL GAECRPPASI VLDRAGVHFD
PRRPSDLELH LTHDHFGETL TSRAQQLIER IVALGVTKYN LSGQAPPFRG GRRTVLVPGQ
VEDDLSVKLG GAGVAGNLDL LRRVRQIEPD AVVLYRPHPD VEAGYRKGLI RDADALRYVD
QVLRGHALPA LLTSVDAVHV LTSLTGFEAL LRGREVVVHG QPFYAGWGLT RDLSPPPRRG
RRLALAELAA AALILYPRYI DPMTGETCSP ETLVARLADQ PEPRPGLLPA LRRLQVGAFR
SLDMAAATSM ANG