Gene Caul_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2082 
Symbol 
ID5899537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2231025 
End bp2232248 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content70% 
IMG OID641562571 
Productcapsule polysaccharide export-like protein 
Protein accessionYP_001683708 
Protein GI167646045 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCT ACGCTCCAAC GATCACCCCC GCGGACGATC CGGCCACCCC GGCGGCGGAA 
CCCCAGGTCC GCGCCGCGCG CCGCCGGCGG TTCGCCCATG CGGGTCCGCC CCCCGCCCCG
GCCAGGCCGA AATGGCAAGA CGGTCTGCGC GCCGCGCGAC CCTTCCTGCT GATCGTGATC
CTGCCGACCC TGCTGGTCGC CGGGTTTCAG TACCTGATCG CCGCGAACCA GTATCAATCG
GAGGCGCACT TCATCGTCCG CAGCGCTCAG CCGAGCGGCG GGACGGGCGG TCTAGGCCAG
ATGCTCGGCC TCACCGGCGC CCCCTCTCCG GCCGAGGCGC ACAGCATCGG CGACTATCTG
CTGTCCCACG ACGCCGTGGC GGCGGTGCAG TGGACGATGG ACCTACCGGC GATCTTCCGG
CGACCGGAGG CGGACCCGAT CACCCGCCTG GACCCGGCGC ATCCCCCGCC CGAGACGCTG
CTGAAATATT ACCGGCGCCA GGTGAGCGTG AGCCTTAATA CGGAAACTGG CATCACCGAC
CTGAGCGTGC GGGCGTTCCG TCCGGCCGAC GCCCGCGACC TGGCCGAGAC CCTCCTGCGG
CTGGGCGAAA CCCGGGTCAA CGCCTTCAAC CAGCGCGCGC TCGCCAACGG CCTTTCGGTG
GCCGACGCCC AATTGCGCGA GGCGGAACGT GGCGTCACCG ACGCCCAGAG GAACCTTACG
GGCTTCCGCC AAGGCGGTCG CGACATCGAC CCTGAACGGA CCAGCACGGC CCAGATCACC
CTGGCCGCCA ACCTACAGCA ACAACTGGCC CAGGCGCGGG CCCAACGCGA CAGCATGGCC
GGCTCGGTCG CGCCGGACAG CCCGCAATAC GTCGCCATCG CCAGGCAGAT CCGAGCCCTG
GAAGTCCAGG CCGCCGCCGC CCAGGGACGC TTGGCCGGTT CCGCGGCGTC CATCGCGCCG
GGGCTCGGCG CCTATGAAAG CCTGCGGTTG CGCCAGGAGT TCGCGGCCAA GCGTTACGAG
GCCGCCGCCG CCTCGCTGGA GGCCGCGCGG GAACAAGCCT TGAAGCAGCA ATTGTTCGTC
ATCCGGGTGG TCGAGCCGAA CCTGCCGAGC AAGGCGCTCT ACCCTCATCG CCTGAAGACC
GTCGCCATCG TCTTCTTCGG TCTGCTGCTG TCCTACGCCG TGGGCTGGCT GATCCTGGCG
GGCGTGCGCG AACACGCCGG ATGA
 
Protein sequence
MDGYAPTITP ADDPATPAAE PQVRAARRRR FAHAGPPPAP ARPKWQDGLR AARPFLLIVI 
LPTLLVAGFQ YLIAANQYQS EAHFIVRSAQ PSGGTGGLGQ MLGLTGAPSP AEAHSIGDYL
LSHDAVAAVQ WTMDLPAIFR RPEADPITRL DPAHPPPETL LKYYRRQVSV SLNTETGITD
LSVRAFRPAD ARDLAETLLR LGETRVNAFN QRALANGLSV ADAQLREAER GVTDAQRNLT
GFRQGGRDID PERTSTAQIT LAANLQQQLA QARAQRDSMA GSVAPDSPQY VAIARQIRAL
EVQAAAAQGR LAGSAASIAP GLGAYESLRL RQEFAAKRYE AAAASLEAAR EQALKQQLFV
IRVVEPNLPS KALYPHRLKT VAIVFFGLLL SYAVGWLILA GVREHAG