Gene Bind_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2007 
Symbol 
ID6199295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2292555 
End bp2293826 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content58% 
IMG OID641705995 
Productmajor facilitator transporter 
Protein accessionYP_001833119 
Protein GI182678973 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.28785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAATC CTCAAAGCCC GAATGATCAA GAGCCAATGG GCCGTTCAGC GCCGCAAGGA 
CCGCAATTCG CGGTTCTTGG CGCGATCAGC CTTTGCCATA TGCTGAATGA CACGATTCAA
TCCTTGATTG TTGCCATTTA TCCCCTCTTG AAAAGCTCCC TGGCCTTGAG TTTCGCGCAG
ATCGGGCTGA TCACGTTCGT CTATCAATTG AGCGCGTCTC TGTTTCAGCC GGTCATCGGA
TATTACACGG ATCTCAAGCC GAAACCGTTC TCGCTCGTCA TTGGCATGAG TTGCAGTCTT
ATGGGCCTGC TGCTCCTTTC GGTAGCGCCG GCCTATGGCA TCGTCCTGGC GGCGGTCGGG
CTGATTGGAC TCGGCTCCTC CGTGTTTCAC CCGGAATCCT CGCGCATCGC TCGATTGGCG
TCGGGCGGTC AGCCGGGCCT TGCTCAATCG GTTTTTCAGG TCGGCGGCAA TTTCGGCACC
GCGATCGGTC CTTTGCTCGC GGCCTTTATC GTTTTGCCGC AGGGCAGGGG GAGCCTCGCC
TGGTTTTCCG TGCTGGCCTT GATTGCCATG GCGTTTCTGT TTCGCATCGG CCTCTGGTAT
GGCCGTGAAC TCGTGCGCAG AAAAGCCAAG GCGCGACAGG CCAATGTCCG AGCGCTGCCC
TTGCCGAAAA AGCAGGTGGC GGTCTCAGTC GCGATTCTGC TCGTCCTGAT CTTTTCTAAA
TATTTCTATC TGGCGAGTAT CTCCAGTTAT TACCTTTTCT ATCTCATCCA CAAATTCGGG
ATCAGTCCTG AAGAGGCGCA ACTCCGCCTG TTCATCTTTC TTGGCTCGGT CGCAGCCGGC
ACCTTGATCG GCGGGCCGAT AGGAGACCGG ATCGGCCGCA AATATGTCAT CTGGGGCTCG
ATCCTCGGCG TTCTGCCTTT CTCGCTGGCG CTCCCCCATG CGGACCTGTT CTGGACGGCC
GTGCTCACGG TGCCGATCGG CATCATTCTG TCCTCGGCCT TTGCGGCGAT CCTTGTCTAT
GCCCAGGACT TGATCCCGGG GCGAACCGGC ACGGTGGCCG GCTTGTTCTT CGGCTTTGCT
TTTGGAATGG GCGGCATTGG CGCGGCGGTG CTGGGTGGTC TTGCTGATAG CCATGGCATT
GAGTTCGTCT ATCAGCTTTG CGCCTGGCTT CCCGCCATCG GCCTGCTCGC CATGTTCTTG
CCGGATCTGC GCGAGAAGGA GCCGAAGGTG AAGGTCGTGA CCAATGTAAC CGAACAGGCC
TCGCAAGCCT GA
 
Protein sequence
MVNPQSPNDQ EPMGRSAPQG PQFAVLGAIS LCHMLNDTIQ SLIVAIYPLL KSSLALSFAQ 
IGLITFVYQL SASLFQPVIG YYTDLKPKPF SLVIGMSCSL MGLLLLSVAP AYGIVLAAVG
LIGLGSSVFH PESSRIARLA SGGQPGLAQS VFQVGGNFGT AIGPLLAAFI VLPQGRGSLA
WFSVLALIAM AFLFRIGLWY GRELVRRKAK ARQANVRALP LPKKQVAVSV AILLVLIFSK
YFYLASISSY YLFYLIHKFG ISPEEAQLRL FIFLGSVAAG TLIGGPIGDR IGRKYVIWGS
ILGVLPFSLA LPHADLFWTA VLTVPIGIIL SSAFAAILVY AQDLIPGRTG TVAGLFFGFA
FGMGGIGAAV LGGLADSHGI EFVYQLCAWL PAIGLLAMFL PDLREKEPKV KVVTNVTEQA
SQA