Gene Bind_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1334 
Symbol 
ID6200862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1543296 
End bp1545002 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content57% 
IMG OID641705328 
Productmajor facilitator transporter 
Protein accessionYP_001832463 
Protein GI182678317 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTTT CGACCAGTGC TGTGCGCCAG CACGTTGAAA AACGGCCGAT GTCTTCAGCG 
GAACGAAAAG TAATTTTCGC CTCCTCGCTG GGAACGGTCT TTGAATGGTA CGATTTTTAT
CTTTATGGCT CTTTGGCCAG TATCATCGGC GCTCAATTCT TTAGCCAATT TCCGAAAACC
ACTGCCGATA TTTTCGCGCT TCTCGCTTTC GCGGCGGGTT TTCTCGTGCG CCCCTTCGGC
GCTCTCGTCT TCGGCCGGCT GGGTGATCTC GTCGGACGTA AATATACCTT CCTGGTGACG
ATCCTGATCA TGGGTCTGTC GACCTTCGTC GTCGGTCTTT TGCCGAATTA CGATTCGATC
GGCATTGCCG CGCCCATCAT CCTGATTTCC GCCCGTCTTC TCCAGGGTCT CGCGCTCGGC
GGTGAATATG GCGGGGCGGC CACCTATGTG GCGGAACATG CGCCGCATGG ACGCCGCGGC
TTTTACACCT CTTGGATTCA GACCACCGCG ACGCTGGGCC TGTTCCTGTC CCTGCTGGTG
ATCCTGGGAA CCCGCACCTT TTTTGGCGAA GCTCGTTTCG CTGAAATCGG CTGGCGTGTG
CCCTTCATTG TTTCGGTCCT GCTGCTCCTC GTCTCCTTGT GGATTCGTCT GCAATTGAGC
GAATCGCCGG CCTTCCTGAA AATGAAGGAA GAAGGCACCG TTTCCGAGAA GCCGCTGACG
GAAGCTTTCG CCACCTGGTC GAATGCCAAA ATCGCTTTGC TCGCCTTGTT CGGTCTCACC
ATGGGCCAAG GCGTCGTCTG GTACACGGGC CAGTTCTATT CCTTGTTCTT CCTGCAATCG
ATCTGCAAGG TCGACGGCTA TACGGCCAAT CTGCTCATCG CTTGGGCGCT TGTTTGCGGT
ACCGGATTCT TCGTCTTCTT CGGCTGGCTC TCCGATCATA TCGGTCGCAA GCCCATCATC
CTCACGGGCT GCTTGCTGGC TGCCCTGACC TATTTCCCGA TCTATCGGGC GATTACCGCC
AACGCCAATC CGGCTCTTGC TCAGGCCCTT GAAACGGTCA AAGTCAAGGT TGTGGCGGAC
CCGGCGGATT GCGGCAATCT CTTCAATCCC GTCGGTACGC GCGTCTTTAC CAGCTCCTGC
GATATCGCCC GCGACTTCCT GGCCAAGAGC GCGGTCCGTT ATGAAATGGT GCCCGGGCCG
GCTGGCAGCC CCGCGCAGAT CGTCGCCGAT GGCGTCAACG TCACCGCATT CGACTCGACG
CAAGTCTCCA ATGCGAAAAC GGCCATGGCC GATTTCTCGA AAACAGCCAC TGCGGCCCTC
CAAGAGGCTG GCTATCCCAA GCCCAATGAT CCGGGCATCA TTCGGATGAA ACATCCGTTC
GATCTTTCGG AACCGCGTGT CTTACATTTG ATCGGCCTGC TCGCCATTCT CGTCATCTAT
GTGACGATGG TTTATGGCCC GATCGCCGCG GCCCTCGTCG AATTGTTCCC GACGCGGATT
CGGTACACGT CAATGTCCCT GCCCTACCAT ATCGGCAACG GCTGGTTCGG CGGTCTGCTG
CCGGCCACGG CTTTCGCCAT GGTCGCCCAG ACAGGTGATA TTTATTACGG CCTCTGGTAC
CCGATCATCT TCGCCGGCAT CACGTTCGTC ATCGGCTCCC TGTTCATTCC CGAAACCAAG
GACCGGGACA TTTACGCCGA GGATTAA
 
Protein sequence
MVFSTSAVRQ HVEKRPMSSA ERKVIFASSL GTVFEWYDFY LYGSLASIIG AQFFSQFPKT 
TADIFALLAF AAGFLVRPFG ALVFGRLGDL VGRKYTFLVT ILIMGLSTFV VGLLPNYDSI
GIAAPIILIS ARLLQGLALG GEYGGAATYV AEHAPHGRRG FYTSWIQTTA TLGLFLSLLV
ILGTRTFFGE ARFAEIGWRV PFIVSVLLLL VSLWIRLQLS ESPAFLKMKE EGTVSEKPLT
EAFATWSNAK IALLALFGLT MGQGVVWYTG QFYSLFFLQS ICKVDGYTAN LLIAWALVCG
TGFFVFFGWL SDHIGRKPII LTGCLLAALT YFPIYRAITA NANPALAQAL ETVKVKVVAD
PADCGNLFNP VGTRVFTSSC DIARDFLAKS AVRYEMVPGP AGSPAQIVAD GVNVTAFDST
QVSNAKTAMA DFSKTATAAL QEAGYPKPND PGIIRMKHPF DLSEPRVLHL IGLLAILVIY
VTMVYGPIAA ALVELFPTRI RYTSMSLPYH IGNGWFGGLL PATAFAMVAQ TGDIYYGLWY
PIIFAGITFV IGSLFIPETK DRDIYAED