Gene Caul_0921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0921 
Symbol 
ID5898376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp967893 
End bp969260 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content73% 
IMG OID641561404 
Productmajor facilitator transporter 
Protein accessionYP_001682550 
Protein GI167644887 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.224208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.285654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAG TCGCTCTCCC GCCCCGTCTG CCGCTGATCC TGGCGGCGGC CAGTTTCGGC 
TTCACGATCG TCCAGCTAGA CGTGACCATC GTCAATGTGG CGCTGGACGC CATCGGCCGC
GAGTTCGGCG CCCCGACCGC CAGCCTGCAA TGGGTGGTCG ACGCCTACAC CCTGCTGCTG
GCGGCGCTGT TGCTAAGCGC CGGGGCGCTG GGCGACCGGT TTGGGCCGCG CCGGGCGTTT
CTGGCGGGGC TGGTGCTGTT CGCGGTGTCG TCGGCGGCCT GCGGCCTGGC CCAGACCAGC
CTGCAACTGA TCCTGTCGCG CGCCGTCCAG GGGGGCGCCG CCGCCCTGCT GGTGCCGCCC
TCGCTGGCCC TGATCACCCA CGCGGCGGCC GGCGACGACT GGGCGCGGGC CTGGGCCGTG
GGCTGGTGGA CGGCGGCCGG CGGGGTGTCG ATCGCCGCCG GGCCGGTGAT CGGCGGCTTG
CTGATCGGGG CGTTCGGCTG GCGCTGGGTG TTTCTGGTCA ACCTGCCGCT CTGCCTGCTG
GGCGCGGCGG CGACCCTGGC CTTCGTGCCC GAGGTTCCGC CGCGCGAGAA GCGGCCGCTG
GACCTGCCCG GCCAGGTCCT GGGCTTCGTG GCGCTGACCC TGCTGGTCGG GGCGGTCATC
GAGGGCGGTC ACAGGGGCTG GAGCGACCCG CTGGTGCTGG GCGCCCTGAT CGGCGGCTTG
GCGGCCGTGG CGGCCTTCCT GGCGGTCGAG ATGGGCAGCG CCCATCCAGC GGTCCCCTTG
GACGTGTTCC GGGGCCGCAT GGTCTGGTCG GCGGCGGTGG TCGGGACGGC GGTGAACTTC
ACCTATTACG GCGTGATCTT CGTGCTCGGC CTCTTCCTCC AGCGCTCGGC CGGCTACAGC
GTGGTGCAGG CCGGCCTGGC TTTCCTGCCG CTGACGGCGA CCTTCATCAT TTCCAACCTC
TTGAGCGGGC GCGTCTCCCA TCGCTTCGGC CCGGCCCGGA CCATGGCCGG CGGGGTGCTG
GTGGCGGCGC TCGGCTACGC CCTGACCAGC CGGCTCACGC CGACCACGCC GTTCTGGTTG
ATGATTCCGG GCTTCCTGCT GATCCCCGGC GGCATGGGCA CGGCGGTGCC GGCCATGACC
AGCGCGCTGC TGGCCAATGT GGACCGGCAC TTCTCTGGCA CGGCGTCGGG GGTTCTGAAC
GCTTGCCGCC AGGCGGCGGG GGCGGCCGGC GTGGCGGTGA TGGGGGCGCT GGCGGCGGGC
GGACCCGAGC GGATCGCGGC GGGGCTGCGG GCGTCGGGGC TGATCGCGGC GGTGGTGCTG
CTGGGCACGG CGGTGGTGGC TTGGCGGAGC GAGGGGGAGG CTATCTAA
 
Protein sequence
MTRVALPPRL PLILAAASFG FTIVQLDVTI VNVALDAIGR EFGAPTASLQ WVVDAYTLLL 
AALLLSAGAL GDRFGPRRAF LAGLVLFAVS SAACGLAQTS LQLILSRAVQ GGAAALLVPP
SLALITHAAA GDDWARAWAV GWWTAAGGVS IAAGPVIGGL LIGAFGWRWV FLVNLPLCLL
GAAATLAFVP EVPPREKRPL DLPGQVLGFV ALTLLVGAVI EGGHRGWSDP LVLGALIGGL
AAVAAFLAVE MGSAHPAVPL DVFRGRMVWS AAVVGTAVNF TYYGVIFVLG LFLQRSAGYS
VVQAGLAFLP LTATFIISNL LSGRVSHRFG PARTMAGGVL VAALGYALTS RLTPTTPFWL
MIPGFLLIPG GMGTAVPAMT SALLANVDRH FSGTASGVLN ACRQAAGAAG VAVMGALAAG
GPERIAAGLR ASGLIAAVVL LGTAVVAWRS EGEAI