Gene Caul_0583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0583 
Symbol 
ID5898038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp637112 
End bp638512 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content63% 
IMG OID641561065 
Productcitrate transporter 
Protein accessionYP_001682214 
Protein GI167644551 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.242191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGC TGATGGTCGC TCTGGGCGCG CTCATCTTCC TCATGCTTGC CGCATACCGC 
GGCTTCAGCG TCATCATTGC GGCGCCTCTG GCCGCCATCG CCGCGATTCT GCTGACAGAG
CCTTCGGCCG TTCCGGCGGT TTTCAGCGGC CTGTTCATGG ACAAGATGGT CGGCTTTCTG
AAGCTCTATC TGCCTGTCTT TCTGCTTGGC GCGATCTTCG GCAAACTTGT CGAAATCTCC
GGGTTCGCTC GGTCGATCGT CTCCGGCTTG GTGACCCTTG TTGGCGCCGA CCGCGCCATG
CTCGCCGTGG TGCTGGTGTC CACCCTCATG ACCTACGGCG GCGTGTCGCT CTTTGTGGCC
GTGTTCGCGG TCTATCCATT CGCGGCCGAG ATGTTTCGGC GCGCCGACAT CCCCAAGCGT
CTCATCCCCG CCACGATCGG ACTAGGCGCG TTCAGCTTCA CGATGGACAG CCTGCCGGGC
AGTCCGCAGA TCCAGAACAT CATCCCGACC ACCTTTTTCG GAACGACCGC CTGGGCGGCG
CCGATCCTTG GTCTCGTTGG CGCGCTTTTC ACCTTCGCCA GTGGAATGGC CTACCTCGAG
TGGGCCCGCA AGCGCGCAAA AAGCCGTGAC GAGGGCTATG GCGAGGGCCA CAGCAATGAG
CCGGCTCCGG CCCCCGAGGC GTTGACGGCG GCAGCGCATC CCCTAGTCGC CCTGATCCCA
CTCGCACTGG TCGGGCTGAG CAATTTGGCT TTAACCCACG CTATCCCGCT GCTCTACCCG
CCGACCCTGA CGACCTCGCT TCCTGGACTG CCAGCGCCTT TGACCACCAA CCTCCAGGCG
GTCACGGCGA TCTGGGCCGT CGAGGGCGCG CTGGCCGTCG GAATCCTCTC CCTGTTCCTG
TTATCGTTCA GGACGGTTGT TCGCGAGGTT GGGGTCGGGA CCAAAGCCGC GGTTGCTGGC
GCCCTGCTGG CGGCTGTGAA CACGGCCTCG GAATACGGCT TCGGTGCGGT GATCGCGGCT
CTGCCAGGTT TCCTCGTCGT GCGTGACGCG CTTAGCGCCG TTCCCAACCC CCTGATCAAC
GAGGCCGTGT CCATCACCGC CTTGGCGGGA ATTACGGGCT CCGCATCGGG CGGCATGAGC
ATCGCGCTTG CCGCTCTGTC CGAGCAATTT ATCGCTGCGG CCCATGGCGC CGGAATACCG
CTGGAGGTCT TGCATCGCGT GGCCTCCATG GCGAGCGGCG GGATGGACAC GTTACCGCAC
AACGGCGCGG TGATCACAGT ACTCGCTGTG ACTGGACTCA CACATAAACA GTCGTACGGG
CCAATATTCG CTATAACCGC TATTAAGACC GCAGCCGTGT TCGTCGTTAT TGCGACGTAC
TACTTGACTG GGATCGTATA G
 
Protein sequence
MTSLMVALGA LIFLMLAAYR GFSVIIAAPL AAIAAILLTE PSAVPAVFSG LFMDKMVGFL 
KLYLPVFLLG AIFGKLVEIS GFARSIVSGL VTLVGADRAM LAVVLVSTLM TYGGVSLFVA
VFAVYPFAAE MFRRADIPKR LIPATIGLGA FSFTMDSLPG SPQIQNIIPT TFFGTTAWAA
PILGLVGALF TFASGMAYLE WARKRAKSRD EGYGEGHSNE PAPAPEALTA AAHPLVALIP
LALVGLSNLA LTHAIPLLYP PTLTTSLPGL PAPLTTNLQA VTAIWAVEGA LAVGILSLFL
LSFRTVVREV GVGTKAAVAG ALLAAVNTAS EYGFGAVIAA LPGFLVVRDA LSAVPNPLIN
EAVSITALAG ITGSASGGMS IALAALSEQF IAAAHGAGIP LEVLHRVASM ASGGMDTLPH
NGAVITVLAV TGLTHKQSYG PIFAITAIKT AAVFVVIATY YLTGIV