Gene Caul_1092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1092 
Symbol 
ID5898547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1158495 
End bp1160171 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content66% 
IMG OID641561574 
Productmajor facilitator transporter 
Protein accessionYP_001682720 
Protein GI167645057 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGCG ACGCCGCGAA ACCGGTGAAA GGCGCCAAGG ATGGGCGCGC CGTGGGCAGG 
AAGGACGCCC TGGTCATCGG AGCCGCTTCG GTCGGCACCG TGTTCGAGTG GTACGACTTC
TACCTCTACG GATCGCTGGC CACCTACATC ACCAAGCACT TCTTCTCAGG CGTCAACGAG
ACCACGGGCT ACATCTTCGC CCTGCTGGCC TTCGCCGCCG GCTTCGCGGT GCGGCCGTTC
GGGGCCTTGG TGTTTGGCCG GCTGGGCGAT CTGTGGGGTC GCAAGAACAC CTTCCTGGTC
ACCATGCTGC TGATGGGCCT GTCGACCTTC GTGGTCGGCC TGCTGCCCAG CTACGCCCAG
ATCGGCATCG CCGCGCCCAT CGCCCTGGTG GTGATGCGCC TGGTGCAGGG CCTGGCCCTG
GGCGGCGAAT ACGGCGGGGC GGCCACCTAT GTGGCCGAGC ACGCTCCGGC CGGACGGCGG
GGGTTCTACA CCAGCTTCAT CCAGGTGACG GCCACCTTCG GCCTGTTCCT CAGCCTGGTG
GTGATCCTGC TGACCCGCCA GGCCGTCGGC GAGAGCAGCT TCGAGACCTT CGGCTGGCGC
ATCCCGTTCC TGATCTCGGT GCTGCTCCTG GGCGTGTCGC TGTGGATCCG CCTGCAACTG
GCCGAGAGCC CCTCGTTCCA GCGCATGGTC GACGAGGGCA AGGGCAGCAA GAAGCCGCTG
GCCGACTCGT TCGCCAAGTG GGGCAATCTG AAGATCGTCA TCCTGGCCCT GGTCGGTCTG
ACCGCCGGTC AGGCGGTGGT CTGGTACACC GGCCAGTTCT ACGCCCTGTT CTTCCTGGAG
AAGATGCTCA AGGTCGATGG CGGCACCACC AACCTGCTGG TCGCCGCGGC CCTGCTGATC
GGCACGCCGT TCTTCGTGGT CTTCGGCTGG CTGTCGGACA AGATCGGACG CAAGCCGATC
ATCATGCTGG GCTGCATCCT GGCGGCCCTG ACCTATTTCC CGCTGTTCAA GACCCTGACC
ACGGCGGCCA ATCCGCAGCT GGCGGCGGCC GTGGCCAGCG CCCCGGTGAC CGTGGTGGCC
GATCCGGCCG ACTGCTCGTT CCAGTTCGAT CCGGTCGGCA AGACGGTGTT CAACCGTTCG
TGCGACCTGG CCAAGTCCTA CCTGGCCAAG GCCGGCGTCA CCTACGCCAA CCAGGCCGCT
CCGGCCGGCG CGGTCGCCCA GGTCAGGATC GGCGCGGCGA CGATCGCCTC GTTCCCCGGC
CAGACCCTCG ACAAGGCCGC CTTCAAGGCT CGCAAGACGG CTTGGGAAAA GGAACTGGGC
GCGGCGCTGA AATCGGCCGG CTATCCCGCC AAGGCCGATC CCGCCCGCAT CGACAAGCCG
CTGGTGATCG GCGTCCTGGC GATCCTGGTG CTCTACGTGA CCATGGTCTA CGGCCCGATC
GCGGCCATGC TGGTCGAGCT GTTCCCGACC AATATCCGCT ACACCTCGAT GAGCCTGCCC
TATCACATCG GCAACGGCTG GTTTGGGGGC TTCCTGCCGA CCACGGCCTT CGCCATGGTC
GCGGCCACGG GCAATATCTA TTACGGCCTC TGGTACCCTA TCGTGGTGGC GGCGGTGACC
GCCGTGGTCG GCATCCTGTT CCTGAAGGAA ACCAAGGACG TCGACATCGA GGCGTAG
 
Protein sequence
MGSDAAKPVK GAKDGRAVGR KDALVIGAAS VGTVFEWYDF YLYGSLATYI TKHFFSGVNE 
TTGYIFALLA FAAGFAVRPF GALVFGRLGD LWGRKNTFLV TMLLMGLSTF VVGLLPSYAQ
IGIAAPIALV VMRLVQGLAL GGEYGGAATY VAEHAPAGRR GFYTSFIQVT ATFGLFLSLV
VILLTRQAVG ESSFETFGWR IPFLISVLLL GVSLWIRLQL AESPSFQRMV DEGKGSKKPL
ADSFAKWGNL KIVILALVGL TAGQAVVWYT GQFYALFFLE KMLKVDGGTT NLLVAAALLI
GTPFFVVFGW LSDKIGRKPI IMLGCILAAL TYFPLFKTLT TAANPQLAAA VASAPVTVVA
DPADCSFQFD PVGKTVFNRS CDLAKSYLAK AGVTYANQAA PAGAVAQVRI GAATIASFPG
QTLDKAAFKA RKTAWEKELG AALKSAGYPA KADPARIDKP LVIGVLAILV LYVTMVYGPI
AAMLVELFPT NIRYTSMSLP YHIGNGWFGG FLPTTAFAMV AATGNIYYGL WYPIVVAAVT
AVVGILFLKE TKDVDIEA