Gene Caul_5131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5131 
Symbol 
ID5897357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp49897 
End bp51105 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID641555234 
Productmajor facilitator transporter 
Protein accessionYP_001676565 
Protein GI167621780 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.229934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCGT CCAAACCTCA ACCGCGCGAG GCACTGCCGC TCGCGCTTTT CGTTCTGACC 
TTGAGCATCT TCGCGATCGG CACCTCTGAG TTCGCCATCG CCGGCCTGCT GACCGAAGTG
GCGTCCGATC TGTCGGTGTC GATCTCCGCC GCCGGGCGCT TGGTGGCGGC GTACGCCTTG
GGCGTGGCGA TCGGCGGGCC GATCATGGCC GTGCTGACCG CGCGGCTGCC GCGCAAGACG
TCCCTGATGG TGCTGATGGC GATCTTCGCG GTGGGCAACG CCGCCTGCGC CCTGGCCATC
CACTACGAAC ATCTGGCGCT CGCCCGCGTC GTCACCTCCT TGGGCCATGG CGCGTTCTTC
GGCATCGGCG CGGTGCTGGC CATGAGCCTG GTTCCCGATC ACCGAAAGGC CTCGGCGGTG
GCGGTGATGT TCGCCGGCTT GACCATCGCC AACATCCTGG GCGTGCCGCT GGGCACGGCC
CTGGGTCAGT GGGCCGGCTG GCGCGCGCCG TTTTGGGCGA TCACCGCCTT GAGCATCGCG
GCTCTGGTCG CGATCCTGAC GATGGTGCCC GACCGGCGCG ACGACGCGCC GCCCAACTTC
GCCGATGAGG CCCGCGCGCT CGCTGACGGC GGTCTTTGGG TCGCCCTCCT GACAACGGTG
GCCTTCGCCA CGTCGATCTT CCTGCTGTTT TCCTACGTCG CGCCCCTGCT CACCCAGGCG
TCGGGTGTTT CGCCCGGCGG CTTGACGCTC AGCCTGCTGT CGATCGGATT GAGCCTGGCC
GTGGGCAATA TTCTGGGGGG ACGTCTGGCG GACTGGAACC TGGGACGCGC CTTGGTCGGC
ATCGCCGTCG TCATCGCCGC AGTTTCCGGT CTGCTGGCCT GGAGCAGCGC GCATCTGCCG
GCGGCGGAGA TTAATTGGTT CGCTTGGGGC GTGGTGACGT TCGCCGCCGT GCCCGCCTCC
CAGGTCAACG TCATGCAACT AGGCCACAAG GCGCCCAACC TCGTCTCGAC GCTGAACATC
TCGGCGTTCA ACATCGGCAT CGCCACCGGC TCCTGGCTTG GCGGACAACT GCTCGACCAA
GGCGCGCGCC TGACCGACTT GCCGCTCGCC GCGGCGAGCG TGGCTGTGGC CGCGGCGGCC
CTGGCCTTCG CTTCGCAAAG GATCGCCCAA GGACGGCGCG CGCACGCCAG CGCGCCCGAA
ATCATCTAG
 
Protein sequence
MNASKPQPRE ALPLALFVLT LSIFAIGTSE FAIAGLLTEV ASDLSVSISA AGRLVAAYAL 
GVAIGGPIMA VLTARLPRKT SLMVLMAIFA VGNAACALAI HYEHLALARV VTSLGHGAFF
GIGAVLAMSL VPDHRKASAV AVMFAGLTIA NILGVPLGTA LGQWAGWRAP FWAITALSIA
ALVAILTMVP DRRDDAPPNF ADEARALADG GLWVALLTTV AFATSIFLLF SYVAPLLTQA
SGVSPGGLTL SLLSIGLSLA VGNILGGRLA DWNLGRALVG IAVVIAAVSG LLAWSSAHLP
AAEINWFAWG VVTFAAVPAS QVNVMQLGHK APNLVSTLNI SAFNIGIATG SWLGGQLLDQ
GARLTDLPLA AASVAVAAAA LAFASQRIAQ GRRAHASAPE II