Gene Bind_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3358 
Symbol 
ID6201627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3810316 
End bp3812226 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content58% 
IMG OID641707304 
Productextracellular solute-binding protein 
Protein accessionYP_001834404 
Protein GI182680258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0726814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.160875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTGGAC GATTCCCGCT CTTCTCGACT CGCCGTCAAG CATTGGGGCT CGCCCTCGGC 
GGCCTTGGCG GTCTCGCCAT GCCGCCCGCT TTTTTCTTTG CAAAGCGCGC GAAGGCTCAG
GAACAAAAGC CAACGGAAAA GCCCGTTGAA GGCGAAGCTT ATGAGGCGCA CGGCCTCTCG
ATCTTCGGCG ATCTCGCCTT ACCGCCTGAT TTCAAGCATC TGCCTTACGT GAATCCGGAT
GCGCCCAAGG GCGGCATCTT CATCGAACAG GCCGGCTTCA ATACGTTCAA CACGCTGAAT
GTCTTCATCC TCAAGGGGGA TGGCGCGGCT GGCATGGGGC TGATCTTCGA CAGCCTGATG
ACGGGAAGCA ATGATGAACC CGATGCGCTT TATGGCCTCG TCGCTCACAA GGTAGAAGTT
TCAGCCGACC GCACCCTCTA TCGATTTTTT CTCCGCAAGG AAGCGCGTTT CCACGACGGC
TCGAAGATCA AGGCGTCGGA CGTGGTGTTT TCCTTGAATC TGATCAAAAC CAAGGGCCAT
CCAGTCTTGC GCCAGGGCCT GCGCGATCTC GAAAGCGCCG AAGCCGAGGC CGAGGATATC
GTACGCGTCC GCATGCGCCC GGGCCATAGC CGCGAATCTC CGCTGACCGT CGCCAGTCAG
CCGATTTTTT CGCAGGCCTA TTACACGACC CATAATTTTG AGGAGACCAC GCTTGAACCG
CCCCTTGGTT CCGCGGCCTA CAAGGTCGGC CCGTTCGAGC AAGGGCGCTA TATCTCTTTC
ACGCGGGTCG AGAATTATTG GGGCAAGGAC CTCCCGATCA ATCGCGGGCG CAGCAATTTC
GATGTGGTGC GCTACGAATA TTTCAACGAT CGCAAAGTGG CTTTCGAGGC GTTCAAAGCC
GGCGTCTTCA CCTATCGCGA GGAATATACC TCACTCATCT GGGCGACAGG CTATGATTTC
GCCGCGCTCA AGGAAGGCAA GGTCAAACGC GAAACTGTTC CAGACGCCTA TCCGCGCGGT
ACGCAAGGCT GGTTCCTGAA TACGCGGAGA ACCAAATTCA AGGATGCGCG CATCCGCCGG
GCGCTCGGCT ATGCCTTCGA TTTCGAATGG ACCGACGCCA ATATCATGTA CAATCTCTAC
AAGCGTACGG TTTCCTATTT CCAGAACTCG CCCATGGCCG CCGAAGGCCT GCCCTCGGCC
GAGGAACGCG CTCTGCTCAC GCCTTTCGAA GATCAATTGC CTCCCGAGGT TTTTGGCGAG
GCTGTCGTGC CCCCCGTCTC AGACGGCTCC GGCCAGGATC GCAAGCTCCT GCGCGAGGCC
AGTGAGCAAT TGCGGCAGGC CGGCTGCACC CGCAAGGGCA ATACACTCCT GCTCCCCGAC
GGCAAGCCCT TCGAAATCGA ATTTCTTGGC TTTGAAACCT CGTTCCAGCC CCATACGGCG
GCTTTCATCA AAAATCTCAA ATTGCTCGGG ATCGATGCCG ATTATCGTGT CGTCGATGCG
GCCCAATATA AACGCCGGGT CGATGAGTTC GACTATGATA TTGTCACGGA ACGATTTAGT
TTCGGCCTGA CGCCAGGCGA GGATATGCGG CTCATTTTCG GTTCCGAAAC GGCGAATCTT
TCCGGCTCCC GCAATGTGGC GGGCATTGCC TTGCCGAGCG TCGATGCCCT CATCGCAAAA
GCTCTGGTCG TCGACACGCG CGAGAACCTG ACCACGATCT GCCGGGCGAT CGACCGCATC
TTGCGCGCCC ATTATTTCTG GGTGCCGATG TGGAACAATC CTAACCATCT GCTGGCCTTC
TGGGATCTGT TCGGCCGTCC GGAGCGCATG CCCCATTATG ATGTCGGCGT GCCCTCGACC
TGGTGGTTCG ATCCTCAAAA GGCCCAGCGC ATCGGTTGGC GGAAACCCTA G
 
Protein sequence
MTGRFPLFST RRQALGLALG GLGGLAMPPA FFFAKRAKAQ EQKPTEKPVE GEAYEAHGLS 
IFGDLALPPD FKHLPYVNPD APKGGIFIEQ AGFNTFNTLN VFILKGDGAA GMGLIFDSLM
TGSNDEPDAL YGLVAHKVEV SADRTLYRFF LRKEARFHDG SKIKASDVVF SLNLIKTKGH
PVLRQGLRDL ESAEAEAEDI VRVRMRPGHS RESPLTVASQ PIFSQAYYTT HNFEETTLEP
PLGSAAYKVG PFEQGRYISF TRVENYWGKD LPINRGRSNF DVVRYEYFND RKVAFEAFKA
GVFTYREEYT SLIWATGYDF AALKEGKVKR ETVPDAYPRG TQGWFLNTRR TKFKDARIRR
ALGYAFDFEW TDANIMYNLY KRTVSYFQNS PMAAEGLPSA EERALLTPFE DQLPPEVFGE
AVVPPVSDGS GQDRKLLREA SEQLRQAGCT RKGNTLLLPD GKPFEIEFLG FETSFQPHTA
AFIKNLKLLG IDADYRVVDA AQYKRRVDEF DYDIVTERFS FGLTPGEDMR LIFGSETANL
SGSRNVAGIA LPSVDALIAK ALVVDTRENL TTICRAIDRI LRAHYFWVPM WNNPNHLLAF
WDLFGRPERM PHYDVGVPST WWFDPQKAQR IGWRKP