Gene Caul_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4246 
Symbol 
ID5901707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4615650 
End bp4617269 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content66% 
IMG OID641564766 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001685866 
Protein GI167648203 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGT CTATCGCACC CCTGCTCAAC CAGGCCGGCA CATCCTATTC TTGCCTGTGG 
GACGCGGGAG GCGTCGATAC GCTGTCCGCC GAAGGCGCGA GCATCGCCTG CACGATCGAC
CTTCGCGAGG CGAGCCTGCT GAACGAGGCG GGCGGCGGCG GCTGGCTGTC GAGCGGGTCG
GGAATCTATG GCGGCTTCAC GATCGCTCAT GACGCAAAGA TCGAAAACGC CATAGGCGGC
GAGGCCGACG ACATCGTTAC CGGAAACGGC CTCGCCAACA AGATCCTGGG CGGAGGCGGA
AACGACAATC TCTCAGGCGA ACTCGGCGCC GACGTCCTGC TCGGCGGCGA CGGCAGCGAC
AGCCTTGATG GCGGCCTGGG CAATGACACT CTGGACGGGG GCCGCAACTT CGACGTGCTG
ATCGGCGGGG CCGGCGCGGA CGTGCTGATC GGCGGCAGCG GCAATGACCG CTTCCAGAGC
ACATTGGCCG ACCTCAACGG CGACACGATC GCCGATCTCA GCGGCGGCGA TCAAATTCGC
ATCACCGACG CCAACATCTC GACCTTCACC ATGAACCGCC AAGGCGAGAC CGTGACGCTG
TCCGGCGGTG TGACCTTCAC GCTGCAGAAC AACCCCCATG GCACGCTGGT GGCCAGCGCC
GATCCCAGCG GCGGGGTGCG CATCAAGCTG CAACTGCCCG AAACCTCGGC CAAGGACGTC
AACGGTGACG GCCACAGCGA CTTCATCTGG CGCCATAGCA GCGGCTATGT GACCGCCTGG
ATGGTCGGCG GCGACGGGGC CGGCATCGGC TTCAAGGCCA ACACCTACGC CTATGACGTC
TCCAATGACT GGAAGCTGGA AACCACGCTG GACTTCAACG GCGACGGGGC GGCCGACCTG
CTGTGGCGCC ACACGGGCGG CACGTTCACG ATCTGGGCCG GCGCGGGTGA AGGCTTCATG
AGCAACACCT TCGTCAGCAG CGATGTCGGC ACGGACTGGA AGCTGGAGGC GGTCGGCGAC
TTCGACGGCG ACGGGCGTAG CGACCTCATC TGGCGTCACG CCAGCGGGAC CTTCTCGGAG
TGGCGCTCGA CCGGCGCGGA TTTCGAGCGC AACTTTGTGG TCGACAGCAC AGTGTCTCCC
AACTTCAAGG TGGCGGCGGT CGGGGATTTC AACGCCGACG GGGTTGACGA CATCTTCTGG
CGCGACATGA CGCCGGGCAG CGCCACGGCG GGTCAGGCGA TGGTGACAAG CTCGCGCGGC
GACATCTTCG CGCCCCCCAG TCAGCAGGTG ACGGGCGTGG GTCTGGACTG GACGCTGGCC
GGGCATGGCG ATTTCAACGG CGACAACATC GAAGACATCA TCTGGCGCGC CGCCAACGGC
ACCTTCACCG AGTGGCAGGG GACCGGCTCC GGCTTCGTGG CCAACGTCTA TGTCGACGCC
ACCGTCAATC CGGCCTGGAA GCTGGCCGAC GTCGCCGACT ACAACGGCGA CGGCAAGGAC
GACCTCATGT GGCGGCACAC GGGCGGCGCC TTCACCCTGT GGCAATCGAC CGGCGACGGC
TTCCTGGCCA ACGTCCTGGT CAACAGCTCG GTCAACGCCG ACTGGGGCCT GGTGGCGTAG
 
Protein sequence
MTTSIAPLLN QAGTSYSCLW DAGGVDTLSA EGASIACTID LREASLLNEA GGGGWLSSGS 
GIYGGFTIAH DAKIENAIGG EADDIVTGNG LANKILGGGG NDNLSGELGA DVLLGGDGSD
SLDGGLGNDT LDGGRNFDVL IGGAGADVLI GGSGNDRFQS TLADLNGDTI ADLSGGDQIR
ITDANISTFT MNRQGETVTL SGGVTFTLQN NPHGTLVASA DPSGGVRIKL QLPETSAKDV
NGDGHSDFIW RHSSGYVTAW MVGGDGAGIG FKANTYAYDV SNDWKLETTL DFNGDGAADL
LWRHTGGTFT IWAGAGEGFM SNTFVSSDVG TDWKLEAVGD FDGDGRSDLI WRHASGTFSE
WRSTGADFER NFVVDSTVSP NFKVAAVGDF NADGVDDIFW RDMTPGSATA GQAMVTSSRG
DIFAPPSQQV TGVGLDWTLA GHGDFNGDNI EDIIWRAANG TFTEWQGTGS GFVANVYVDA
TVNPAWKLAD VADYNGDGKD DLMWRHTGGA FTLWQSTGDG FLANVLVNSS VNADWGLVA