Gene Caul_4540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4540 
Symbol 
ID5902001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4913767 
End bp4914807 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content69% 
IMG OID641565059 
Productcation diffusion facilitator family transporter 
Protein accessionYP_001686158 
Protein GI167648495 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1230] Co/Zn/Cd efflux system component 
TIGRFAM ID[TIGR01297] cation diffusion facilitator family transporter 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.115451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACG ATCACGATCA CAAGCACGAT CATTCCGACC ATGATCACCA CGATCATGGC 
CATGACGACC ACCATGATCA CGATCATGGC CACGGCCACT CGCATGACCA CGGCCACCAT
CACCACGGCC CCGGTGGGCA CAGTCATGCG CCCAAGGATT TCGGCCGCGC CTTCGCGGTC
GGCGCGACGC TGAACATCGG CTTCGTCATC GCCGAGACCG TCGCCGGCCT GATGACCCAT
TCGCTGGCCC TGCTGGCCGA CGCCGGCCAC AACCTGTCGG ACGTGCTGGG GTTGTTCATG
GCCTGGGGCG CGGTGGTCCT GGCCAAGCGG GCGCCGGCCG GCCGCCATAC CTATGGCCTG
CGCAAGGGCA CGATCCTGGC CTCACTGACC AACGCCGTCT TCCTGCTGGT CGCGGTCGGC
GCCATCGCCT GGGAGGCCGC CCGCCGCTTC GCCGATCCGC ACCCGATCGA CACCGGGCCG
GTGATGATCG TCGCGGCGAT CGGCATCGTC ATCAACACCG CCACGGCCCT GATGTTCATG
CGCGGCTCGA AGGACGACCT CAATATCCGC GGCGCCTTCC TGCACATGGC CGCCGACGCC
GCCGTCTCGG CGGGGGTCGT GGTCGCCGCC CTGGTCATGT GGCGCACCGG TTGGTTGTGG
CTGGACCCGG TGGTCAGCCT GGGCATCGTG CTGGTGATCG TGCTGGGCAC CTGGAGCCTG
CTGCGCGACA GCCTGGACCT GGCCCTGGAC GCCGCCCCGC GCGGCATCGA CCCCATGGCG
GTAAAGGACT GGCTGACCGC TCGCCCCGGC GTCAGCGAGG TCCACGACCT GCACATCTGG
GCGATGAGCA CCACCGAAAC GGCCATGACC GCCCACCTGG TGCGGCCCCT CGGGGCCGGC
ATGGTCGGCG AGGACCTCGA CGCCTTCCTG CACGACGCCT GCGCCGAACT GAACAGTCGG
TTCAAGATCG GCCACGTCAC CCTGCAGGTC GAGCACAGCG GCGCGGCGTC CTGCCGGCTG
GCCCCCGCGG ACGTGGTGTG A
 
Protein sequence
MPNDHDHKHD HSDHDHHDHG HDDHHDHDHG HGHSHDHGHH HHGPGGHSHA PKDFGRAFAV 
GATLNIGFVI AETVAGLMTH SLALLADAGH NLSDVLGLFM AWGAVVLAKR APAGRHTYGL
RKGTILASLT NAVFLLVAVG AIAWEAARRF ADPHPIDTGP VMIVAAIGIV INTATALMFM
RGSKDDLNIR GAFLHMAADA AVSAGVVVAA LVMWRTGWLW LDPVVSLGIV LVIVLGTWSL
LRDSLDLALD AAPRGIDPMA VKDWLTARPG VSEVHDLHIW AMSTTETAMT AHLVRPLGAG
MVGEDLDAFL HDACAELNSR FKIGHVTLQV EHSGAASCRL APADVV