Gene Caul_1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1766 
Symbol 
ID5899221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1865465 
End bp1866739 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content66% 
IMG OID641562256 
Productmajor facilitator transporter 
Protein accessionYP_001683393 
Protein GI167645730 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.601014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.308968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCTC CAACGATGCC GGCGCCTTCC GAGAAGATCG GGCGGTACAG GTGGGTGATT 
GTCGGCCTGC TGTTCCTGGC CATGGTGATC AACTATGTCG ACCGCCAGAC GATTGGCCTG
CTGAAGGCCG ATCTCTCCAA GGAATTCGGC TGGGACGAGA CCCACTACGC CGACCTCGTC
TTCTACTTCC AGCTGGCCTA CGCCGTGGCC TATCTCGGTT GGGGCAAGGT GATGGACAAG
ATCGGGGCCC GCTGGGGCTT CGGCATCGCC TTCCTGATCT GGCAGGTCGC CCACATCGGT
CACGCCCTGG CGCGCGGCTT CGGCGGCTTC GCCATCGCTC GCATGGGCCT GGGTATCGGC
GAGGCCGGCG GCTTCCCGGG CGGCATCAAG GCCGTGGCCG AGTGGTTCCC CAAGAACGAG
CGGGCCCTGG CCACCGGCAT CTTCAACGCC GGCACCAATA TCGGCGCCAT CGTCACGCCG
CTGGTGGTGC CGGGCATTGT CCTGGCCTTC GGCTGGCAGA TGGCCTTCAT CGTCACCGGC
GTGGCCGGCC TGATCTGGCT GCCGCTGTGG CTGATCGTCT ATCGCCGCCC GCGCGAGCAG
ACGCGCCTGT CGGCCGCCGA ACTGGCCCAT ATCGAGCAGG ACCCCGCCGA CCCCGTCGAG
AAGATCGGCT GGGCCAAGCT ACTGACCAAG AAGGAGACCT GGGCCTACGC CCTGGGCAAG
TTCCTGATCG ATCCGATCTG GTGGATGTTC CTGTTCTGGC TGCCCGACTT CCTGGGCAAG
CGCTATCACC TGGACCTGAA AACGTTCGGC CCGCCGCTGA TCGCCATCTA TCTGATGAGC
GACGTCGGCA GCGTCGGCGG CGGCTGGCTG TCGTCCTCGC TGATGAAGCG CGGCTGGAGC
ATCAACAAGG CCCGCAAGAC CACCATGCTG GTCTGCGCCC TGCTGGCCAC GCCGGTGATC
TTCGCCGCCA ATGTCGACAG CCTGTGGGCC GCCGTGCTGA TCATCGGCGT CGCCACCGCC
GCCCACCAGG GCTTTTCGGC CAACCTCTAC ACCCTGCCGT CGGACGTCTT CCCGCGCGGC
GCCGTGGGCT CGGTGGTCGG TATCGGCGGC ATGCTGGGCG CCGTCGGCGG CATGGTGTTC
TCCAAGTATA TCGGCAAGGT CCTGGACCAG ATCGGCACCT ACACGCCGAT CTTCCTGGTC
GCTGGCAGCG CCTATCTGGT CGCCTTGCTG GTCATCCACC TGCTGACCCC GAAGATGGAG
CCGGTGAAGG TCTAG
 
Protein sequence
MDAPTMPAPS EKIGRYRWVI VGLLFLAMVI NYVDRQTIGL LKADLSKEFG WDETHYADLV 
FYFQLAYAVA YLGWGKVMDK IGARWGFGIA FLIWQVAHIG HALARGFGGF AIARMGLGIG
EAGGFPGGIK AVAEWFPKNE RALATGIFNA GTNIGAIVTP LVVPGIVLAF GWQMAFIVTG
VAGLIWLPLW LIVYRRPREQ TRLSAAELAH IEQDPADPVE KIGWAKLLTK KETWAYALGK
FLIDPIWWMF LFWLPDFLGK RYHLDLKTFG PPLIAIYLMS DVGSVGGGWL SSSLMKRGWS
INKARKTTML VCALLATPVI FAANVDSLWA AVLIIGVATA AHQGFSANLY TLPSDVFPRG
AVGSVVGIGG MLGAVGGMVF SKYIGKVLDQ IGTYTPIFLV AGSAYLVALL VIHLLTPKME
PVKV