Gene Bpro_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_3544 
Symbol 
ID4013807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp3744755 
End bp3746002 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID637943206 
Productmajor facilitator transporter 
Protein accessionYP_550350 
Protein GI91789398 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.773676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.279857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGC CCCAAACCGC TGGCCCGCGC AAGGCCGCAC TCTGGCTGGT GCTGCTGGCA 
GCCGGCGGCG CCTTTGCCCT GACGATGGGT GTGCGCCAGA CCATGGGCCT GTTCCTGTCG
GCGCTTAACA CTTCGACCGC GCTGGGCATC GGCAGCATCA GCTTGGCCTT TGCCTTCGGC
CAGCTCTGGT GGGGCCTGAC ACAGCCCTTT GCCGGCGCTG TGGCCGACCG CATCGGCACC
GGTCGCGTGA TTTTCATCGG CGTGCTGCTG GTCGCCGTGG GCACCATCAT CACGCCGCTC
ATGACCAGCA CGGCCGGCCT GATTTTTGCC ATCGGCGTAC TGGCCGCGGG CGGCGCCGGC
ATGGCCGGTC CGTCGGTCCT GATGGCCGCG AGCACGCGCC TGGTGCCGCC TGAAAAACGG
GGTCTGGCCA CCGGCATCGT CAACGCCGGC GGCTCCTTCG GCCAGTTCGC CATGGCGCCC
ATCGCCATTG GCCTGACGGC CGCCGTGGGC TGGGCCAGCG CGATGCAATG GCTTGGCGTG
CTGGTGTTGC TGGCGCTGCC GGCGGTGTGG GCGCTCAGGG GCAACGCGAA AGCCCTGGCC
GCGCAGGCCG CGGCGGCTTC GGGCACCAAG GCGCTCAGCG CCCGCCAGGC GATTGGCGAG
GCGCTTGCCA CGCCCAGCTA CCGCTACCTG AGCCTGGGCT TCCTGGTGTG CGGTTTTCAC
GTCGCCTTCC TGGCCACGCA CCTGCCGGGC GTGGTGGCGG CCTGCGGCCT GCCGCCCGAG
GTGGGCGGCT GGGCGCTCGC GATGATCGGC CTGTTCAACA TCGTCGGCAG CCTGGCGATG
GGTTGGGCTG TCGGCCGCTG GCGCATGAAG TCGCTGCTGG CCCTGCTGTA CACCACGCGT
GGTCTGGCCG TGCTGGCCTT CCTGCTGGCG CCCAAAACGC CGGCCGTCAT GCTGGTCTTT
GCGGCCGTCA TGGGCGTGAC TTTTCTCTCG ACGGTTCCGC CAACCGCAGG GCTGGTCGCC
AAGATGTTCG GCACCTCCAA CATGGCCATG CTGTTTGGCG TGGTGATGCT GGCGCACCAG
GTCGGCGGCT TTCTCGGCGC CTATCTCGGC GGTTATGTGT TTCAGGTGAC CGGCAGCTAC
GACTGGGTCT GGTACATCGA CATTGTGCTG GCCGCTGGCG CTGCGCTGGT GAACCTGCCG
ATCCGCGAAG CCGGTCTGCC GCGCCGCGCC ATGTCTGCGG CGGCCTGA
 
Protein sequence
MNKPQTAGPR KAALWLVLLA AGGAFALTMG VRQTMGLFLS ALNTSTALGI GSISLAFAFG 
QLWWGLTQPF AGAVADRIGT GRVIFIGVLL VAVGTIITPL MTSTAGLIFA IGVLAAGGAG
MAGPSVLMAA STRLVPPEKR GLATGIVNAG GSFGQFAMAP IAIGLTAAVG WASAMQWLGV
LVLLALPAVW ALRGNAKALA AQAAAASGTK ALSARQAIGE ALATPSYRYL SLGFLVCGFH
VAFLATHLPG VVAACGLPPE VGGWALAMIG LFNIVGSLAM GWAVGRWRMK SLLALLYTTR
GLAVLAFLLA PKTPAVMLVF AAVMGVTFLS TVPPTAGLVA KMFGTSNMAM LFGVVMLAHQ
VGGFLGAYLG GYVFQVTGSY DWVWYIDIVL AAGAALVNLP IREAGLPRRA MSAAA