Gene Bind_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3004 
Symbol 
ID6198288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3412491 
End bp3413762 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content63% 
IMG OID641706947 
Productmajor facilitator transporter 
Protein accessionYP_001834056 
Protein GI182679910 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGACA CGACACCACC GCGAAAGGCG CAAACGAAGC CCAGCTTCCG TCCCGCATCG 
CTCGACGCGC TCAACTTCCT TCTCGCCGAC GTGCGCGGCG CGCTCGGACC TTATCTCAAC
GTCTTTCTGA TCACGCAGCA GGGCTGGAGC CAGTCATCCG TCGGTGTCGT GACGACGATC
GGCGGCTTGA TCGGCCTCAC GGCGCAAACA CCGGTCGGCG CCACAATCGA CGCCACGCCG
GCGAAACGCG CCGTCGTGGT GGTGGCGCTT AGCGCCCTCG CTATTGGCGC GGTCGTCATT
TTCGCCGTCC CGAGCTTCTG GCCGGTGCTG GCTGCCAACA CGGTGATGGC GGTCATCGGC
GATGTCTTCG GCCCGGCCGT CGCCGCGCTA ACGCTTGGAC TGTTCGCGCA GGGGCAATTG
GCGGCAAGGA TGGGTCGCAA TGGCGCCTTC GATCATGCCG GCAACGTTGT GGTCGCGCTG
GTCGCCGGCG GTATCGGCTG GTTGTTCGGA CAAAGCGCGG TGTTCTTGCT TGTGCCACTC
TTTGCCGTCC TCGCCATTGG CGCGGTGCTG TCCATTCCCG CAGCGGCGAT TGATCACGAA
CGCGCGCGTG GAGCCGGTCC AACCAGTGGG GCTGATCGTG GTCCAGACGA TTGGCGAATT
CTGTTCAAGA GCCGACCGCT GGTGGTCTTT GCGCTGAGCG CCGCGTTGTT TCATTTTGCG
AACGCACCGC TGCTGCCGCT CGTTGGGCAA AAACTCGCGC TTGCGAATAA GGAATTCGCG
ACCGCGATGA TGTCATCGTG CATCATCGCC GCGCAGTTGG TGATGCTGCC GATCGCGCTC
TTCGCCGGAC AAAAGGCCGA GCAATGGGGT CGCAAGCCTG TGCTGCTCAT CGGCTTTGCA
ATTCTTCCTT TGCGGGCTCT GCTTTACACC TTCTCGAACG ACAGTGCCTG GTTGATCGGC
GTCCAGTTGC TGGATGGTGT CGGCGCCGGC ATCTGGGGCG TGCTGGCCCC GCTCGTCGTC
GCGGATGTGA TGGCCGGGAC GGGTCAATAC AATCTGGCGC TGGGAACTGT GGCAACCGCT
CAGGGTATCG GCGCTTCGCT CAGCGGTTTG GCGGCTGGCT TGGTCGTCGA TCATTTCGGC
TACAATGCGG CCTTCGCCTG CTCGGCCGGC GCTTCCCTGG TAGCCTTGGC CGTACTCGGT
CTTGCTCTGC CAGAGACGGG TCGTCCCAAG GAGACACAAG CGGCACTTGC GGCGACGCTA
ACCGATGGTT AG
 
Protein sequence
MDDTTPPRKA QTKPSFRPAS LDALNFLLAD VRGALGPYLN VFLITQQGWS QSSVGVVTTI 
GGLIGLTAQT PVGATIDATP AKRAVVVVAL SALAIGAVVI FAVPSFWPVL AANTVMAVIG
DVFGPAVAAL TLGLFAQGQL AARMGRNGAF DHAGNVVVAL VAGGIGWLFG QSAVFLLVPL
FAVLAIGAVL SIPAAAIDHE RARGAGPTSG ADRGPDDWRI LFKSRPLVVF ALSAALFHFA
NAPLLPLVGQ KLALANKEFA TAMMSSCIIA AQLVMLPIAL FAGQKAEQWG RKPVLLIGFA
ILPLRALLYT FSNDSAWLIG VQLLDGVGAG IWGVLAPLVV ADVMAGTGQY NLALGTVATA
QGIGASLSGL AAGLVVDHFG YNAAFACSAG ASLVALAVLG LALPETGRPK ETQAALAATL
TDG