Gene Caul_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2958 
Symbol 
ID5900413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3212569 
End bp3213717 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content72% 
IMG OID641563455 
Productnitrate transporter 
Protein accessionYP_001684583 
Protein GI167646920 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.899389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGC TGGCCGACCT GACCCTGGGC TTCATCCCGC TGACCGATTG CGCGCCGCTG 
GTCGTGGCCA AGGCCCAGGG CTTCTTCGCC GAGGAGGGGC TGGAGGTCGC GCTGAGCCGC
GAGGCCTCGT GGGCGACGAT CCGCGACAAG GTCGCCGTGG GCGCGCTGGA CGGCGCCCAC
ATGCTGGCGC CGATGGCTCT GGCCGCCGGC CTGGGGGAAG GCCTGGCCGC CGCGCCGATG
CTGGCCCCCT TGGCGCTGAA CCAGAACGGC AGCGCGATCA CCGTCTCCAC CAGGCTGGCG
GGGAAGCTGC GTGAGATCGA TCCCGAGGCC ATGGCCACGC CGCTGACCAC CGCCCAGGCC
CTGGCCCGGT TCGTGGAGCG GCGTCGAGAC CAGGGCGCGC CGCTGCTGAC CTTCGCGGTC
GTCTTCCCGC AGTCGATGCA CAACTACGCC TTGCGCTATT GGTTGGCGCA GGCGGGAATC
GACCCTGACC GGGATGTGCG CCTGGTGGTC ACGCCGCCAC CTCGGATGGT CGAGCACCTG
CGTTCCGGCG ACATCGACGG CTTCTGTGTG GGCGCGCCCT GGAACGCCGT CGCCATGGAC
GAGGGCCTGG GCGAGGTGCT GATCAAGGCC TCGCAGTTCT GGCCCGGCGG CCCGGACAAG
GTGTTCGGCC TCACCGCCGT CTGGGCCGAG CGGCATCCTG ACGAACTTCG GGCGGCCTTG
CGCGCCTTGA TCCGGGCTTC GGCCTGGACC GACGAGGCGG GCAACCATGC GGAGCTTGTG
GCCCTGCTGT CACGGGCCGA CCACGTCGGC GTCGAGCCCG AAGCCCTGGC CCGCGCGCTG
AAAACGGAGA TCGTCTTCCA TCGCGACGCC GCCGGCCTGC CGCGCCGCGA GCACGCCCTG
TGGTTCCTGT CGCAGATGGT TCGCTGGGGG CAGGTGGGGC GGGACGTCGA TCTCGACGCC
GTCGCCGACC GCGTCTATCG CCCGGACCTG TTTCGCGAGG CGGCGCTGTC GCTGGGGCCG
GTGTTCGAGC CAGCCATGGT GTTCGCCGAC GCCGCGCCCG CGCCGTCAGC CTTGTTCGAT
GGCAAGCCGT TCGATCCGGC GGACGCGCGG GGCTATGCGG CGTCGTTCGC GATCGGGCGC
GGTTCCTGA
 
Protein sequence
MSGLADLTLG FIPLTDCAPL VVAKAQGFFA EEGLEVALSR EASWATIRDK VAVGALDGAH 
MLAPMALAAG LGEGLAAAPM LAPLALNQNG SAITVSTRLA GKLREIDPEA MATPLTTAQA
LARFVERRRD QGAPLLTFAV VFPQSMHNYA LRYWLAQAGI DPDRDVRLVV TPPPRMVEHL
RSGDIDGFCV GAPWNAVAMD EGLGEVLIKA SQFWPGGPDK VFGLTAVWAE RHPDELRAAL
RALIRASAWT DEAGNHAELV ALLSRADHVG VEPEALARAL KTEIVFHRDA AGLPRREHAL
WFLSQMVRWG QVGRDVDLDA VADRVYRPDL FREAALSLGP VFEPAMVFAD AAPAPSALFD
GKPFDPADAR GYAASFAIGR GS