Gene Caul_4655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4655 
Symbol 
ID5902117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5031669 
End bp5032796 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID641565174 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001686273 
Protein GI167648610 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTA CATTCCTGAC ACTGACCGAG AACGAGAGCT ATGTGATCGG TGCGCCCGAC 
ACGAAGTTTC CCTTCGTGGT CGTCGGTGTG GGCGGCGGCG GCGTTGTCAC CACGCCCAAT
CCCAACCCCG AACCTCGTGA GGCCTTCTTC AAGTACGAGT ATCAGGCGGG CGGCAAGGGC
TTTGGCCACT TCGTGGTCAA CAACCCCTAT TTCGGCGAAT ATTCGCCGGT CTTCGTGACC
ATCCTCGGCG CGGACGGCTC GGCGCCGGTC CAGCATGCGC CGACAGCCCA GGGCGAAACA
CTGACGCTGG CGAACGCCAA CAGCGGCTTC AACCTGTCCC GGTTGCTGGC CAACGATGTC
GATCCGGACG GCGACCTACT GTACGTCCAC ATCGTCTCGC CGTTCTCGTT CACCGCGCCG
GCGGGCTCGA CCACGTCCGC CGAGGTCTTC AGTCATTCAC CGGATCTGCC ATTCAACACG
GTGTTTCCGC TCGACGGCAG CCAGCTGTCC ATCGCCGCTG ACAAGCCCGA CGGCACGCCC
CTGGGCTATA CCGAACTGCG GTTCGACTAT TTCGTCAGCG ACGCCTATGG CAACGCCTCC
AATACGGTCC AGGCCGTGAT CAAGATCGGC GCGCCGCCGG CGGGCGCCTA TGTCGCGGGC
GGGGCCGGTG ACGACACCAT CGACAAGAGC GGCACGACCG TCGCCTGGCA ATTGGCCGGC
GGGGGCGGCG ACGACTATCT GTGCGGCGGC TCGGGCAATG ACAGCCTGAA CGGCGGGGCG
GGCGACGATC GGCTGATCGG CGGCGCGGGC AACGACGTCC TCACGGGCGG GACGGGCGCC
GACCGCATGT TCGGCGGCGC GGGCAATGAC ACGTTCCTGA TCCGGGCGGG AGACCTGGCG
ACCGGGCCGG TCAAGGACCA GATCATCGAC TTCGAGGGCG CCGGCGTGAC CGGTGGCGAC
ATGCTGCGGC TCGTCGGCTT CGGCGCCGGC GCCACCCTGG TGCATCTGGG CGAGGTCGGC
GCGGTTTCCC ACTACATGAT CAACGACGGC GCGCACTCCG GCGAGTTGTG GGTGCAGGCG
GGCGGCGTCC TGCTGCAACC GGGCGACTAC GGCTTCGTCT CCGCCTAG
 
Protein sequence
MAATFLTLTE NESYVIGAPD TKFPFVVVGV GGGGVVTTPN PNPEPREAFF KYEYQAGGKG 
FGHFVVNNPY FGEYSPVFVT ILGADGSAPV QHAPTAQGET LTLANANSGF NLSRLLANDV
DPDGDLLYVH IVSPFSFTAP AGSTTSAEVF SHSPDLPFNT VFPLDGSQLS IAADKPDGTP
LGYTELRFDY FVSDAYGNAS NTVQAVIKIG APPAGAYVAG GAGDDTIDKS GTTVAWQLAG
GGGDDYLCGG SGNDSLNGGA GDDRLIGGAG NDVLTGGTGA DRMFGGAGND TFLIRAGDLA
TGPVKDQIID FEGAGVTGGD MLRLVGFGAG ATLVHLGEVG AVSHYMINDG AHSGELWVQA
GGVLLQPGDY GFVSA