Gene Caul_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2571 
Symbol 
ID5900026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2791427 
End bp2792848 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content66% 
IMG OID641563062 
Productpeptidase M48 Ste24p 
Protein accessionYP_001684196 
Protein GI167646533 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCC GCAGTTGGGC GGCTTCCGGA AAGTTGGGCG CAAAGACCCT GTCGGCGCTG 
GCCTTGGTCG CGACGATGCT GGCGCCCGTC GCCCCGGCGC GCGCTCAGGA CGGCCCCTCG
CTGATCCGCG ACACCGAGAT CGAGGAGATC CTGCACCGCG ACGCCGACCC CATCTACGCG
GCGGCTGGGT TGGATCCCAA AACCGTCCGC ATCCTGCTGG TCGGCGACAA GGAGTTGAAC
GCCTTCGCCA CCCAGGGGTT GATGATGGGT CTCAACACCG GCCTGATCCT TCAGACCGAG
ACGCCCAACC AACTGAAGGG CGTCATCGCC CACGAAACCG GGCATTTGGC CGGCGCGCAC
CCGCTGCGTT CCGACGAGCT GATGAAGGCC GGCCTCAAGC CGATGATCCT GACCATGGGC
CTGGGCGTGC TCGCCGCCCT GGCCGGAGCT CCGGACGCCG GCGCCGTGCT GTTGGGCAAC
GCCTCCTATT TCGGGACCCT GGGCGCGCTG GGCTACAGCC GCGACCAGGA ATCGCGGGCC
GACCAGGCCG GCGCCGGTTT CCTGGAGGCC ACCGGCCAGT CGGGCCGCGG CCTGGTCGAG
TTCTTCGACA ACTTCCGCTA TCAGGAGGTC TTCGACCAGT CGCGGCGCTT CGCCTATTTC
CGCAGCCACC CGCTGTCGGG CGACCGGATC GACGCCCTGC GCAGCCGCGT CGAGAAACTG
CCCCACTATA ACAGCGTCGA CGACCCCACC TCGCTAGCCG AGCACGAGAT CATGAAGGCC
AAGCTGGAGG GCTTCATCAA TCCCGGCGTG GCGATCGTGA AATACAAGGA AGCCGACAGG
GGATTCCCGG CCCGTTACGC CCGCGCCATC GCCTATTACC AGCTCAAGGA ACCCGATCGG
GCTCTCAAGA TCCTCGATGG TCTGATCGCC GAGAACGCAG ACAACCCCTA TCTCTGGGAG
CTCAAGGGGC AGATTCTGTT CGAGTTCAAT CGCGTCAAGG AAGCCGAGGA GCCGCAGCGT
CGCTCTGTGG CCCTCAAGCC CGATGCGGCC CTGCTGCGGG TCAATCTGGG CCAGACCCTG
ATCGGCCAGG ACGATCCCAA GAAGGTCGAG GAAGGCATCA GCGAGCTGAA GCGCTCGCTG
ATCGACGAAA GCGACAATTC CGTCGCCTGG CGCCTGCTAG CCCAGGCCTA TGACAAGCGC
GGCGAGGATG GTCAGGCCCG CCTGGCCACC GCCGAGCAAT ATTTCAACAT GGGCGCCGCC
CAGGAGGCTC GCGTATTCGC CATGCGAGCC CGCGAGTTGC TCAAGAAGGA CAGCGTCGAA
TGGCGCCGCG CCACCGACAT CGTCCTGACT TCCAATCCTT CCAACCAGGA CCTCAAGGAC
CTGGCCAAGG AAGGCGCCGT CACCTCGGGC CTGCGCCGCT AG
 
Protein sequence
MTSRSWAASG KLGAKTLSAL ALVATMLAPV APARAQDGPS LIRDTEIEEI LHRDADPIYA 
AAGLDPKTVR ILLVGDKELN AFATQGLMMG LNTGLILQTE TPNQLKGVIA HETGHLAGAH
PLRSDELMKA GLKPMILTMG LGVLAALAGA PDAGAVLLGN ASYFGTLGAL GYSRDQESRA
DQAGAGFLEA TGQSGRGLVE FFDNFRYQEV FDQSRRFAYF RSHPLSGDRI DALRSRVEKL
PHYNSVDDPT SLAEHEIMKA KLEGFINPGV AIVKYKEADR GFPARYARAI AYYQLKEPDR
ALKILDGLIA ENADNPYLWE LKGQILFEFN RVKEAEEPQR RSVALKPDAA LLRVNLGQTL
IGQDDPKKVE EGISELKRSL IDESDNSVAW RLLAQAYDKR GEDGQARLAT AEQYFNMGAA
QEARVFAMRA RELLKKDSVE WRRATDIVLT SNPSNQDLKD LAKEGAVTSG LRR