Gene Caul_5276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5276 
Symbol 
ID5897276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp215700 
End bp217217 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content69% 
IMG OID641555379 
Productmajor facilitator transporter 
Protein accessionYP_001676710 
Protein GI167621925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000299294 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCGCG ACGAGCCCGT GCACGGCGGA GGAGAGCTCG CGTCCGCCGG GGGCGAGACG 
ATTTCGCCGC GGCCCAAGCT GCGGCTTGGT CTCAAACTTG GCTATGCTGC CGGAGCCTTG
CTGGACGGCG TGGCGACCCA GGCGGTCAAC ATCTTCCTCT TCTTCTACGC CACCACCGTG
TGCGGCGCGC CCGCCGCCCT GGCGGGCGTG GCCATCGCCG CCGGCCTTGT CGTTGACGCC
CTGATCGATC CGATGATCGG GTCGGCGTCG GACGCCTGCC GGTCGCGCTT TGGGCGGCGG
CTGCCGTTCA TGATGTGGGG CGCGCCGGCC ACGGCCTTGT TCCTGGTTTT GATCTTTTCG
CTGCCCGAGG CCCTGAGCGG CGTGGGGCTG GTGGCCTGGA TCACGGTGCT GTCGATCTGT
CTGCGCGTGT CGATCTCGCT CTATCTGCTG CCGTTCAACG CCGTGGGAGC GGAACTGAGC
GAGGACTACG CCGAGCGTTC CTCGATCGCC GCGTGGCGCT GGGGCGCCGC CATGGCGGGC
GCCCTGACGG CTGTCCTGCT TGGCTTTGGC GTGTTCTTTT CGGGCCCCGA GGGCTTGGCG
AAGCGCGCGG CCTACACCCC GTTCGGCGTC AGCATAGCCT TGGTTGCGCT GGTCGGCGCG
GGCCTGGCCA TGCGCGCGCT GATGTTGACG CGCGATCGCC AGAACCCGCC CCCACGTGAG
ACGGGCCCGG CGCACACACG GTTTGTGCGC GCGGTCGGCG AGATTTTCGC CAATCCGTCG
TTTCGCGTTC TGTTCGCCGG CGCGATCCTG CTGTTTAGCG CCCTGAGCGT TCACAGCACC
CTGGGCCTAC ACGCCAACAC CTATTTTTGG CGCCTGGAGC CCAAGCAGAC CCAAGCGGTG
ACCTTGGCCC TTTTTGCCGG CCTCCTGCTG GGGGCGCCGT TGGCCGGGCC TCTGCTGCGA
CGGCTGGAAA AGCGCGTGGT GCTGCTGATC GGCATCGGGG GAATGGGCCT GGCCTTGGCT
GGGCCAGCGG TCCTTCGCCT GGCGGGACTT TTACCCTTGG AAGGCGGCCA GCTGGTTCTG
CTGTTGGCCG GCGCGCTGTT CTTCGGCGGC GTGCTGATGG CGGCCGCGGC GATCGCGTTC
GTGTCCATGA TGGCCGACGC GGCCGACGAG CACGAGTACC TCTTCGGCGC GCGGCGAGAG
GGTCTCTACT TCGCCGGATG GGCGTTCGCC AGCAAGGCCG CCGCGGGTCT TGGAGCCTTG
ATCGCCGGTC TGGGGTTGGA ACTGGTCGGG TTCAAGAGCC ACGGCGGATC GGTGGCCCAG
GCCTTGTCGC CTCGGACGAT CGAGTGGATC GGCGCGCTCT ACGGGCCCGG CGCGGGCGCG
TTGGCGTTGG CGGCCGCCGC GACGTGCCTG TTCTACCGGC TCGACGCCGC CCGCCATGCC
CGGATGCTCG CTGTCCTTCG GACGCGTCGA GCCGGTCAAA CCGAGCCGGA CTCCACAGTC
CAGGAGACCG CGGCGTGA
 
Protein sequence
MARDEPVHGG GELASAGGET ISPRPKLRLG LKLGYAAGAL LDGVATQAVN IFLFFYATTV 
CGAPAALAGV AIAAGLVVDA LIDPMIGSAS DACRSRFGRR LPFMMWGAPA TALFLVLIFS
LPEALSGVGL VAWITVLSIC LRVSISLYLL PFNAVGAELS EDYAERSSIA AWRWGAAMAG
ALTAVLLGFG VFFSGPEGLA KRAAYTPFGV SIALVALVGA GLAMRALMLT RDRQNPPPRE
TGPAHTRFVR AVGEIFANPS FRVLFAGAIL LFSALSVHST LGLHANTYFW RLEPKQTQAV
TLALFAGLLL GAPLAGPLLR RLEKRVVLLI GIGGMGLALA GPAVLRLAGL LPLEGGQLVL
LLAGALFFGG VLMAAAAIAF VSMMADAADE HEYLFGARRE GLYFAGWAFA SKAAAGLGAL
IAGLGLELVG FKSHGGSVAQ ALSPRTIEWI GALYGPGAGA LALAAAATCL FYRLDAARHA
RMLAVLRTRR AGQTEPDSTV QETAA