Gene Caul_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2118 
Symbol 
ID5899573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2281158 
End bp2282489 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID641562607 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001683744 
Protein GI167646081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.107224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.128982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC CGCGTCGCTG CTCCTGGCGC GGACCCCGAC CCGTGGCCCA CACCTCGCCG 
ATCCGGCCCG GCCGACGGGA AATCCTGTTC CTGCAGGGAC CGCCCGGCTG CTTTTTCAGG
GAATTGGCCC GACAGGTCGA GGCCGAGGGG CACGGCGTCC ATCGCATTAA TTTCAACGGC
GGCGACGCCC TGGACTGGAG GGGCGGCGGG ATCAATTACC GCGGCGGCGT CGGGGCGTGG
CCGAGCTTTT TGGCGCGGTT GCTGCTGGAA CGCGACATCA GCGACGTGGT GCTGTTTGGC
GACTGTCGCC CCATCCATCG CGCGGCGCGC GGCGTGGCGG CGGGCCTGGG CCTGACGGTT
CATGTCTTCG AGGAAGGTTA TATACGCCCT AATTGGGTGA CCCTGGAGCG AGGCGGGGTC
AACGGGTTCT CCACCCTGTC GTCCGATCCG CAATGGTATC TCGACGCCGC CGAGCGGCTG
GCGCCCATAC CCGAACACGG TCCCTTGCCG TCGGCGATCG ATCGGCGCGC CCGGGCCAGC
GTCGCCTATC ATCTGGCGAC CGTCCTGTCG GCGGCGGCCT TTTCGGGCTA TCGCAACCAT
CGGCCCTGGC ACCCGGCGGC CGAGGCCGCG GGCTGGGCCG GGCGTTGGAT CCGGCGGCGC
CTGGGCGGCG CAAATCCCGA GCCAAGCCTG GATGGGACGC CCTACTTCCT GCTGCCGCTG
CAACTGGACT CCGACTACCA ACTCCGCACG CACTCGGACT ACGAGGGCAT GCAGCCGGCC
CTGGCCCAGG TGATCGCCTC GTTCGCCCGC CATGCGCCGG TCAACGCCAG CCTGGTCGTC
AAGGAACACC CGTTGGACAA CGGCCTGCGC GACTGGCGGC GCCGGACCTT GGATTATGCG
CGCGCCCTCA ACGTGTCCGA CCGCGTGGTG TTTCTCGACA CGGGGGATAT CGACACCCTG
GTTGGCGACG CCCAGGGGGT GGTGACCATC AACAGCACCA CCGGGACCCT GGCGCTCGCG
GCGGGCGTGC CCGTGGCCAC CCTGGGTCGC GCGATCTACA ATATCGCCGG CCTGACTCAT
CGCGGCCCGC TCGACACCTT CTGGCGGACG CTGACCAAGC CCGACCCTCG CCTCTACGAG
GCCTTCCGCC GGGTGCTGGC CAGCCGCTGC CTGCTGTGGG GCGGGTTCTA CGACCTGGCG
ACGCGTCAGG CCCTGGTCCG GGCGGCGACC GAGCGCATGC TGGGACCGCG ATCCGACACG
GTCGCCGCGC CCAAGGGCGC GCCGCCGTTC GTCAGGAATC CGGCGCCGCC AGCCCTGATC
GCCGCCGAAT GA
 
Protein sequence
MTAPRRCSWR GPRPVAHTSP IRPGRREILF LQGPPGCFFR ELARQVEAEG HGVHRINFNG 
GDALDWRGGG INYRGGVGAW PSFLARLLLE RDISDVVLFG DCRPIHRAAR GVAAGLGLTV
HVFEEGYIRP NWVTLERGGV NGFSTLSSDP QWYLDAAERL APIPEHGPLP SAIDRRARAS
VAYHLATVLS AAAFSGYRNH RPWHPAAEAA GWAGRWIRRR LGGANPEPSL DGTPYFLLPL
QLDSDYQLRT HSDYEGMQPA LAQVIASFAR HAPVNASLVV KEHPLDNGLR DWRRRTLDYA
RALNVSDRVV FLDTGDIDTL VGDAQGVVTI NSTTGTLALA AGVPVATLGR AIYNIAGLTH
RGPLDTFWRT LTKPDPRLYE AFRRVLASRC LLWGGFYDLA TRQALVRAAT ERMLGPRSDT
VAAPKGAPPF VRNPAPPALI AAE