Gene Caul_4668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4668 
Symbol 
ID5902130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5046917 
End bp5048701 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content71% 
IMG OID641565187 
Productsodium/hydrogen exchanger 
Protein accessionYP_001686286 
Protein GI167648623 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0475] Kef-type K+ transport systems, membrane components 
TIGRFAM ID[TIGR00932] transporter, monovalent cation:proton antiporter-2 (CPA2) family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0852287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGCACG CGATAACGCC GGCGGACTAC AAGGACCTGG TGCTGTTCCT GGCCACGGCG 
GGCATCGTCG CGCCGCTGTT CAAGCGGCTG AAGCTGAACC CCATCCTGGG CTTTCTCATC
GCCGGCGTGA TCCTGGGGCC GTTCGGCCTG GGCGCGCTCA GCCACCGGCT GCCGTGGCTG
GACTACGTCA CGGTCGACAG CCCCGAGGAA ATCGCCCAAC TGGCCGAGTT CGGGGTGGTG
TTCCTGCTGT TCATGATCGG CCTGGAGCTG TCATGGGAGC GCCTGCGGCT GCTGCGCAAG
CTGGTGTTCG GCCTGGGTGC CCTGCAGATG ATCGGCTGTT CGCTGGCGCT GGGCGCGGTG
GCCTGGCTGC TGGGCCAGAC CCCGGTCGCG GCCCTGACCA TCGGCGCGGC CCTGACCCTG
TCGTCCACCG CCATCGCCGT GCCGGTGCTG GTCGAGCGGA AGCGCCTGCA TTCCGAGGGC
GGACGGGCGA CCTTCTCGGT GCTGCTGTTC CAGGACCTGG CCGTGGCCCC GATCCTGATC
ACCCTGGCGG TGCTGGGGCG GGCCGACGGC GCGTTCCGCC TGACCGACTT GCTGGCCCTG
GGCCCGGCGG CCGTCGGCCT GGGCGTCATC GTGCTGTTTG GCCGCCTGGC GCTGCGGCCG
ATGATGCGCT CGGTCGCGAA AGCCAAGAGC GAAGAAATGT TCATGGCCGC CTGCCTGCTG
GTGATCATCG GGGCGGGCCT GGTGGCCGCC CTGTCGGGTC TGTCGATGGC CCTGGGCGCC
TTCGTGGCCG GGGTGCTGCT GGCCGAAACC GAGTACCGCC ACGAGGTCGA GGTCAAGATC
GAGCCGTTCA AGGGCCTGCT GCTCAGCCTG TTCTTCGTCT CGCTGGGCAT TCGCCTGGAC
CTGTCGCTGC TGGTCGCCTC GCCGGGTCTG GTGCTGGGCG TCGCCGTCGG GCTGCTGGCG
ATCAAGGGCG TGATGATCAC GGGGCTGGGC CGGCTGTTTG GCCTGTCGAA CCGCGCGGCC
ATCGAGGCGG CCCTGACCCT GGCGGCGGGC GGCGAGTTCG CCTTCGTGAT CCTCGACAAC
GCCATGGGCG CCGGCGTGGT TCAGGCCCGG ATCGGCCAGG CGGTGCTGGT GGCCGCCACA
CTGACCATGT TCCTGATCCC GCTGCTGTCG GGGATCGGCG GACGCCTGGC CAAGAAGACC
GCCGCCCCGG TCAGCGAGGC GCCCGATCTG GTGGGCCTGC AGAGCGAGGA GCCGGCGGGC
CGCGTGCTGG TGGTCGGTTA CGGCCGCGTC GGCCGGCTGG TCGGCGACAT GCTCGACCGC
CACGAGCTGC CGTGGATCGC CATCGATCGC GACCCCGGCT TCGTCCAGCA GGGCCGCCGG
GCCGGCCACC GGGTCTACTA CGGCGACGCC TCGCGGGTGG AGCTGCTGGA GCGCTGCGGC
CTGGACCACG CCCGCGCGGT GGTGGTGACC ATGGACTCGC CGGAAGCCGC CGAGGCGGTG
GTGGCCACCG CCCGCGGCCA TCGTCCCGAC CTGACCATCG TCGCCCGGGC CCGCGACGCC
CGCCACGCCG CCCGGCTCTA CGAACTGGGC GCCACCGACG CCGTGCCGGA GACCATCGAG
GCCAGCCTGC AGTTGTCTGA AGCCGTGCTG GTCGACATCG GCGTGCCCAT GGGCCTGGTC
ATCGCCTCGA TCCATGAACG CCGCGACGAG TACCGCAAGG TGCTGAACCG CCCGGACGCC
CTGGGCGGGC GGCGCAAGAG ATTGAGGGAT GCGGGTAGGG TTTAG
 
Protein sequence
MEHAITPADY KDLVLFLATA GIVAPLFKRL KLNPILGFLI AGVILGPFGL GALSHRLPWL 
DYVTVDSPEE IAQLAEFGVV FLLFMIGLEL SWERLRLLRK LVFGLGALQM IGCSLALGAV
AWLLGQTPVA ALTIGAALTL SSTAIAVPVL VERKRLHSEG GRATFSVLLF QDLAVAPILI
TLAVLGRADG AFRLTDLLAL GPAAVGLGVI VLFGRLALRP MMRSVAKAKS EEMFMAACLL
VIIGAGLVAA LSGLSMALGA FVAGVLLAET EYRHEVEVKI EPFKGLLLSL FFVSLGIRLD
LSLLVASPGL VLGVAVGLLA IKGVMITGLG RLFGLSNRAA IEAALTLAAG GEFAFVILDN
AMGAGVVQAR IGQAVLVAAT LTMFLIPLLS GIGGRLAKKT AAPVSEAPDL VGLQSEEPAG
RVLVVGYGRV GRLVGDMLDR HELPWIAIDR DPGFVQQGRR AGHRVYYGDA SRVELLERCG
LDHARAVVVT MDSPEAAEAV VATARGHRPD LTIVARARDA RHAARLYELG ATDAVPETIE
ASLQLSEAVL VDIGVPMGLV IASIHERRDE YRKVLNRPDA LGGRRKRLRD AGRV