Gene Caul_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1919 
Symbol 
ID5899374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2060073 
End bp2061458 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content67% 
IMG OID641562409 
Productmajor facilitator transporter 
Protein accessionYP_001683546 
Protein GI167645883 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC TTTCCGTCGC AGTGCGTTCC GCGGCTGGCG CCAAGGGCCG TCGCCGGGCG 
CCTGGACTGT TGTCGCTGTC GCTGATGGGC ATACCGCTGA CGGCGCTGTC GCTGCCGCTC
GTCGTCATGA TCCCGGAACA CTACGCCACC GTGCTGGGCC TGCCGCTGGC GGTGATCGGC
TTGATCTTCA CCAGCGTGCG GGTGTTCGAC ATCGTAGTCG ATCCGCTGCT CGGCGCGGCG
ATGGATCGGA CCCGCACCCG CTGGGGCCGC TACCGCCCCT GGCTGATCTT CGGCGCGCCC
GCGCTGATGC TGGCCGTCTA TCTGCTGTTC ATGGCCAAGC CAGGCGTGGG ACCGCTCTAT
CTGCTGGCCA CCCTGACGGC GAGTTTCCTG GGCTGGTCCA TCCTCTCGCT GGCGCAACTG
GCGCTCGCGT CTGGCCTGGC GGCTGGTTAC GACGAGCGGT CTCGGGTGTA CGCTTGGCTG
CAGTTCGCCT CGCTCCTGGG CATTCTGACG GTGATGGGCT TTCCGATCCT CTCGGCCAAG
CTCGGCGAGA CCAGCCTGGC TCCAACCCAG CTGATGGGAT GGATCATCAT CGTCCTGATG
GCGCCCGCGG TGGCCTTGGC CGCCTGGCGC GTGCCTGAAG CCACGGCGGC GGCGCAAAAA
CATATTGTCG GGATCAAGGA ATATCTGTCC GTCGTCGGCC GCCAGGCCGT CTTCCGGATC
GCCGCGATCG ACCTGCTGTT CGGCCTCGGA TTCGGGACTG CTTCGGCCAT GCTGGTGTTC
TTCGTGACGG CCGCCAAGGG GCTGGATCGC AGCGCGGTCG GCGTCGTCTT GATCGCTCAA
GTCGTCACCG CCATGATCAC CGTACCCGCC GTGGCCTGGC TGGCGCGCCG GCTCGACAAG
CATTTCGTGC TGGGAATTAC GGGCCTTCTC GCGGCGCTGG TCAGCGTGGC CTTCATCTTC
CTTCCCGACC ACAACCTGTT GGCGGTGTCG CTGGGCATGA TGGGCTGGGG CCTGTCGTTT
GGGGCCTTCA ACCTGCTGCC GCGCGCAATG ATGGCGGATG CGGGCGACGA GCTGAGGCTG
GAGTCGGGTT CCGACCAGAC CGGCGTCCTC TACGCTCTGT TGATCAGCAG CTGGAAGCTG
GGCGGAGCCT TGGCGGTCGG CCTGTCGTTC GCGGCCCTGG CTCTGGTCGG CTACAAGCCG
GCGCTTATGG GCGCTAACAC GCCGCAGGCC ATTTCGGGCC TGGAAATGGT GTTCGCGGGC
CCTTCGGCGG CGCTGTTCCT GCTCGGCGCC TGGCTGGCCT TCACCTATCC GCTGACTCGT
GAAAAGCACG CCGCCATCCG GCTTGAGCTG GACGCCCGTG ACGCCCGCGG CGAGGTCCCT
GCATGA
 
Protein sequence
MTTLSVAVRS AAGAKGRRRA PGLLSLSLMG IPLTALSLPL VVMIPEHYAT VLGLPLAVIG 
LIFTSVRVFD IVVDPLLGAA MDRTRTRWGR YRPWLIFGAP ALMLAVYLLF MAKPGVGPLY
LLATLTASFL GWSILSLAQL ALASGLAAGY DERSRVYAWL QFASLLGILT VMGFPILSAK
LGETSLAPTQ LMGWIIIVLM APAVALAAWR VPEATAAAQK HIVGIKEYLS VVGRQAVFRI
AAIDLLFGLG FGTASAMLVF FVTAAKGLDR SAVGVVLIAQ VVTAMITVPA VAWLARRLDK
HFVLGITGLL AALVSVAFIF LPDHNLLAVS LGMMGWGLSF GAFNLLPRAM MADAGDELRL
ESGSDQTGVL YALLISSWKL GGALAVGLSF AALALVGYKP ALMGANTPQA ISGLEMVFAG
PSAALFLLGA WLAFTYPLTR EKHAAIRLEL DARDARGEVP A