Gene Caul_4267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4267 
Symbol 
ID5901728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4639121 
End bp4640410 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content75% 
IMG OID641564786 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001685886 
Protein GI167648223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.447116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGT TGAACCCTCG CCCCGGCGCC ATGATCGCGA TCCTGTCGTG CCTGCTGGCG 
CTCGGCGCTG GGCCGCCGTC GGCCTCGGCT CAGCTATTGC CGTCGCCATC CTCGGCGGTG
GGCCGAACGG GTCTGCCGCT GCCGCGGGTC CCGCTGGTCG ACCGGCTGGC GGACGACGCG
GGCCAGGTCG CGGAGCGGCT GACGCCCGAA GGGTTGGCGG CCGCGCGGCT GGACCGCCTC
GCCGCTGTGG TCCGCGCCCA TCCCCGCGCG CTGGAGAGGG ATGACCTGGG CCAGCCAGTG
GTGCGCGGCG AGGTGCTGGC CGTGTCGCCC ACTCCAGAAG CCCTGGCCAG GGCCGTCGCG
GTCGGCTTCG TCGTCGCCCG CCAGACGCGG TCGGAGGCGC TGGGGCTGGA GCTGGTCACC
CTCACGCCGC CCAAGGGGAT GAGCGTTCGG GCGGCGGTGC GGCGGCTGCG GGGGCTGGAT
CCGACCGGCG ACTACGACTT CAACCACCTC TATGCCGACG CCGGTCGGGG CGGCCCGCCG
GTCGTTGTCA ACGGCGGCGC GTCCGCAACG GACGGCCACG GGCGTGCGGG GCTGCTGGAC
GGCGGCGTCG CCGACGACCA GCCGGCCTTC GCGCGGGTCA AGGTCCAGCA GCGCGGGTTC
GCCAAAGGCG GCCCGACCCC CAGCGCCCAC GGTACGGCCA CCGCTTCGCT GATCGCCGTC
AGCGGGGTCG ATGAGCTTCT GGTGGCCGAC GTCTACGGCC AGGGACCGAC TGGCGGCTCG
GCCGAGGCCA TCGTCGGAGC CCTGTCCTGG ATGGCCCAGG CGCGCGCGCC GGTGATCAAT
ATCAGCCTGG TGGGGCCGCC GAACGGGGCG CTGGGCGCGG CGATCCGGGC GCTGGTCGCC
AAGGGCTGCC TGATCGTGGC GGCGGTCGGC AACGACGGCC CCGCCGCCCC GGCCCTCTAT
CCAGCCTCCT ATCCTGGGGT GATCGCGGTG ACGGGGGTGG ATCGCCGCCA CCAGCTGTTG
CCCGAGGCCA GCCGCGCCGG TCATGTCGAT TTCGCCGCCT ACGGGGCCGG GGTGCGGGTC
GCCGCGCCCG GTGGGCGCAC GATGACGGTG CGCGGCACCT CCTACGCCGC GCCGGTGGTG
GCCGGCCGGC TGGCGCGGTT GCTGGACGCG CCCGATCCGG CGGCGGCCAA GCGGGCGTTG
GCCCAGCTTG CCGCCGGGGC CATCGACCTG GGCGCGCCAG GCGTGGATCC GGTCTTCGGC
AAGGGTCTGG TCGAGGCGGG GCCAAAATAA
 
Protein sequence
MTSLNPRPGA MIAILSCLLA LGAGPPSASA QLLPSPSSAV GRTGLPLPRV PLVDRLADDA 
GQVAERLTPE GLAAARLDRL AAVVRAHPRA LERDDLGQPV VRGEVLAVSP TPEALARAVA
VGFVVARQTR SEALGLELVT LTPPKGMSVR AAVRRLRGLD PTGDYDFNHL YADAGRGGPP
VVVNGGASAT DGHGRAGLLD GGVADDQPAF ARVKVQQRGF AKGGPTPSAH GTATASLIAV
SGVDELLVAD VYGQGPTGGS AEAIVGALSW MAQARAPVIN ISLVGPPNGA LGAAIRALVA
KGCLIVAAVG NDGPAAPALY PASYPGVIAV TGVDRRHQLL PEASRAGHVD FAAYGAGVRV
AAPGGRTMTV RGTSYAAPVV AGRLARLLDA PDPAAAKRAL AQLAAGAIDL GAPGVDPVFG
KGLVEAGPK