Gene Caul_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2521 
Symbol 
ID5899976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2735008 
End bp2736126 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID641563012 
Productaminodeoxychorismate lyase 
Protein accessionYP_001684146 
Protein GI167646483 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCC CGCGCCCTCC CGCCAATAAA GGGGCGCCGT CCAGGGCGCC GTCGCCCCGC 
AAGGCGCCTC GCAAGCCGAG CCCACGCCGG CGGGAGTCCG CGCCCATCGG TCGCCGCCTG
GTCGCCATGG GCGGCGGCCT GCTGACCATG CTGGGCGTGG CCGCCCTGGC GGTCGTGCTG
GGGGCGGTGT GGCTCTACCA GGGGCCCGGC CCGGCGGCGC GTTCGGGCGA GGTCACCACC
GTCGTCCTGC GTCGCGGCGC CAGCCTGCCC GAGATCGCCT CGACCCTGGA GCAGGCCGGA
GTGATCCGCT CGTCCTCGAT CTTCCTGACC GCCGCCCAGA CCACCGGCGC GGCGCGGCGG
CTGAAGGCCG GCGAATATGA GTTCCCGTCG CGCGCTTCGC TGCGCCAGGT TCTGGGCAAG
ATCCGCGACG GCAAGATCGT GCGCCACCAC GTGACGATCG CCGAGGGCCT GACCTCGGAC
ATGGTGGTCG ATATTCTGAT GCGCGCGCCT GAGTTGACCG GCACCGTGCC GACCCCGCCG
GAAGGCTCGA TCCTGCCCGA GACCTATCAG GTCCAGCGCG GCGAGGACCG CGCGGCGGTG
CTGCAGCGGA TGATGGACGA CCGCGACGCC CTGCTGGACA AGCTGTGGGC GCAGCGCCAG
CCGGGCCTGC CGTTCGAGAC CAAGGATCAG GCCGTGACCA TGGCCTCGAT CGTCGAGAAG
GAAACCGGCC TGGCCGCCGA GCGTCCGCAT GTGGCGGCGG TGTTCATCAA CCGCCTGCGC
CAGGGGATCC GCCTGGGCAG CGACCCGACC ATCATCTACG GCCTGACCCG CGGTCGGCCG
CTGGGCCGCG GCATCCTGCA GTCGGAACTG CAGCGCCAGA CGCCCTACAA CACCTATCTG
ATCGAGGGTC TTCCACCGAC CCCGATCGCC AATCCCGGCA AGGCCGCTCT GGAAGCCGTG
CTCAATCCGA TGAAGAGCAA TGACCTCTAC TTCGTCGCCG ACGGCACGGG CGGCCACGTC
TTCGCCTCGA CCTATGCGGA GCACGAGCGC AATGTCGCCA GGTGGCGGCA GGTCGAGCGC
TCGAAAGCGG TCGCGAAGAT CCCGTTGGCC GGAGGCTAG
 
Protein sequence
MSRPRPPANK GAPSRAPSPR KAPRKPSPRR RESAPIGRRL VAMGGGLLTM LGVAALAVVL 
GAVWLYQGPG PAARSGEVTT VVLRRGASLP EIASTLEQAG VIRSSSIFLT AAQTTGAARR
LKAGEYEFPS RASLRQVLGK IRDGKIVRHH VTIAEGLTSD MVVDILMRAP ELTGTVPTPP
EGSILPETYQ VQRGEDRAAV LQRMMDDRDA LLDKLWAQRQ PGLPFETKDQ AVTMASIVEK
ETGLAAERPH VAAVFINRLR QGIRLGSDPT IIYGLTRGRP LGRGILQSEL QRQTPYNTYL
IEGLPPTPIA NPGKAALEAV LNPMKSNDLY FVADGTGGHV FASTYAEHER NVARWRQVER
SKAVAKIPLA GG