Gene Caul_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0485 
Symbol 
ID5897940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp527073 
End bp528389 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content69% 
IMG OID641560968 
ProductCitMHS family citrate/H+ symporter 
Protein accessionYP_001682117 
Protein GI167644454 
COG category[C] Energy production and conversion 
COG ID[COG2851] H+/citrate symporter 
TIGRFAM ID[TIGR00784] citrate transporter, CitMHS family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.224436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGG CGGTGCTCGG CTTCCTGATG GTGATCAGCT TCATGACGCT GATCATGACC 
AAGCGGCTGT CGGCGGTGAC GGCCCTGATC CTCGTGCCGA TCCTGTTCGG ACTGCTGGCG
GGGGCAGGGC CGCACCTGGG CGGCATGATC GTCAAGGGCG TCACCCAACT GGCGCCCACC
GCCCTGATGC TGGCCTTCGC CGTGCTCTAC TTCGGCGTCA TGATCGACGC GGGGCTGTTC
GATCCACTGG TGCGCAAGGT GCTGGCGACG GTCGGCGACG ATCCGGTCCG GGTCACCGTC
GGCACCGCCT TGCTGGCCAT GGTGGTGTCG CTGGACGGCG ATGGGACCAC GACCGCCCTG
GTGACGATCT CGGCGCTGCT GCCGGTCTAT CGCCGCCTGC GGCTCAACCC CTTGATCCTG
GCCCTGCTGC TGGGTCTCGG CAACACCATC GTCAACCTCA CGCCCTGGGG CGGACCAACG
GCGCGGGCCG CCACGGCCCT GCACCTGGAG CCCAGCGCGG TGTTCCTTCC GCTGGTCCCG
GCCATGGTGA TCGGCCTGGC CGGCGTGCTG GGGCTGGCAT GGTGGCTGGG ACTGGGCGAG
CGCAAACGCC TTGGACGCCT GACGCTGGAT CCGGTCCTGG CGGAGAGTGG GGCCGACCCG
CGCCTCGACC TGGCGGAGGA CCCAACGATC AAGCGGCCCA AGCTGATCGG ATTCAACCTG
GCCCTGACCC TGGTGCTGGT CGCCGGACTG GTGTCAGGCC TCGCGCCCTT GCCGGTGCTG
ATGATGGGGG CCTTCGCGAT CGCCGCGATC GTCAACTATC CCGACATCGC CAGCCAGCGA
AAGCGCGTGG CCGCCCACGC CAACAACGTC CTGGGCGTGG TGCTGCTGAT CTTCGCGGCG
GGGGCCTTCA CCGGCATCCT GTCGGGGACC GGCATGGTCA AGGCGATGGC CGACGCCACC
CTGCTGGCGA TCCCGCCAAG CCTTGGACCC TATCTGGCGC CCATCACCGC CCTGATCAGC
ATGCCCATGA CCTTTTTCAT GTCCAACGAC GCCTTCTATT TCGGCGTCCT GCCGGTGATC
GCCGAGACCG CCGCCAAGTT CGGCGTGCCC GCCGAGGCCA TCGCCCGGGC TTCGCTCATC
GGCCAGCCGG TCCATGGGCT CAGCCCTCTG GTCGCGGCGA TCTATCTGGT CACCGGCCTG
CTCAAGGTCG AGGTCGGCGA GGCCCAGCGA TTCTCGCTGA AATGGGCGAT CGCCGCCTGT
CTGCTCCTGC TGGTCGCCGC CCTGGCGACC GGCGCCTTTC CCCTGCGAGC CGGCTGA
 
Protein sequence
MNLAVLGFLM VISFMTLIMT KRLSAVTALI LVPILFGLLA GAGPHLGGMI VKGVTQLAPT 
ALMLAFAVLY FGVMIDAGLF DPLVRKVLAT VGDDPVRVTV GTALLAMVVS LDGDGTTTAL
VTISALLPVY RRLRLNPLIL ALLLGLGNTI VNLTPWGGPT ARAATALHLE PSAVFLPLVP
AMVIGLAGVL GLAWWLGLGE RKRLGRLTLD PVLAESGADP RLDLAEDPTI KRPKLIGFNL
ALTLVLVAGL VSGLAPLPVL MMGAFAIAAI VNYPDIASQR KRVAAHANNV LGVVLLIFAA
GAFTGILSGT GMVKAMADAT LLAIPPSLGP YLAPITALIS MPMTFFMSND AFYFGVLPVI
AETAAKFGVP AEAIARASLI GQPVHGLSPL VAAIYLVTGL LKVEVGEAQR FSLKWAIAAC
LLLLVAALAT GAFPLRAG