Gene Bind_3462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3462 
Symbol 
ID6201062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3935400 
End bp3937058 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content58% 
IMG OID641707415 
Productextracellular solute-binding protein 
Protein accessionYP_001834508 
Protein GI182680362 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0173201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTAT CAATACAGCA AGCACAAAGC TTCGGGACGC ATGGCCGCAC ATTAGCCAGA 
CGGCTTCTCA TCAAGCTGCT GCTTCTTGCC GGGTTGATCG TCCCCTTTCC TCCGCCGATT
CAGGCCGAGC CCGCCACGAC CGCGCAAATT TGGCAGCGCG GCGAACTCGG CGATCCGGGC
TCACTGGATC CCCACAAGGC GACCACGGTC ATCGAGGGCC ATGTCCTCGC CGAGCTTTAC
GAAGGGCTGG TCATTCTCGA TGCGGAAGGG CGCCTGCAAC CGGGTGTTGC TTCGCACTGG
TGCGTGAGTG AGGATCGGCG CGTCTATCGC TTTCATTTGC GCCCCGATGC GCAATGGTCG
AACGGCGACA AAGTGACGGC ACAAGATTTC GTCTATGCCT TTCGCCGCCT TATGGATCCC
AAGACCGGGG CGCCTTATGC CAATATTCTC TATACGCTGA AAAATGCGGA AAAGGTCAAT
AAAGGGCAAT TGCCGGTGGA AGCGCTCGGT GTGCAGGCGC CCGCTGACGA TCGTCTCGAA
ATCACGCTCG ATGAGCCGGT TCCCTATCTG CTCGCGCAAC TCACGCATGT GACGGCGAAA
CCTCTCCACC GCCGTTCGAT TGAAACCTAT GGCAGCGATT TCGTGCATCC TGGCCATCTT
GTCACCAATG GCCCCTTCAT GCTGGCGGAA TTCTCGCCCA ACGATCGTCT CGTCCTTGTC
AAGAACCCGC ATCATTACGA TGCGGCGCGC ATCGGGCTCG ACAAAGAGAT TTTCTATCCG
CTGGAAGATC GTTCGGCCGC TTTGCGCCGG TTCCTGGCCG GCGAAATCCA GTCCTATAGC
GATGTGCCCG TCGATCAGAT CCGCTTCGTG CGCCGAACAC TGGGCGACCA ATTCAAGCTC
GCCCCTAATC TCGGCACTTA CTATTACGCA CTCGACACGC GGCGCCCGCC CTTCGATGAT
ATTCGCGTGC GCAAGGCTCT CTCGATGGTG ATCGACCGCG ATTTTCTCGC CGAACGAATC
TGGGGCGGCA CGATGGAGCC GGGCTATAGT TTCGTCCCGC CCGGCATTGA ATCCTACGGC
ACGCCGGCCG AACTCGCCTT CAAGGATAAA ACGCCCATCG AGCGGGAAGA TGAGGCCAAA
AAGCTCCTGG CCGAGGCGGG TTTTGGGCCG AGCGGCAAGA CGCTCACCGT CGAGATTCGC
TATAATATTT CGGAAAACCA CCGCGCCACG GCCGTCGCCG TCGCCGATAT GTGGAAACAG
ATTGGCGTCG AGACCACGCT TGTCGCCAGT GACGCAACCA GCCATTATGC CTTCATGCAT
GAGCGCCGGC CCTTCAATGT TCTCCGTTAC GGCTGGTTCG CCGATTTTCC AGATGCGGAA
AATTTCCTGT TTCTCGCCGA AAGCGGCAAC AAGGGTCTCA ATATTTCGAG CTTCAGCAAC
GAGACCTATG ATTCATTGAT GCGCGATGCC GCTCAAGAGG ATGATGCAAC GCGGCGCACC
GCGCTTTTGC ACCAGGCGGA GGCCTTACTG CTCGCCGAGG GTCCCTATGT GCCGCTGCTC
ATCTTCAAAT CAAAAAATCT GATCTCTCCG AAACTGCGCG GCTGGCACAC CAATGCGCTC
GATGTGCATC GTGGCCGTTA TATATCGATC GCGCCATGA
 
Protein sequence
MNVSIQQAQS FGTHGRTLAR RLLIKLLLLA GLIVPFPPPI QAEPATTAQI WQRGELGDPG 
SLDPHKATTV IEGHVLAELY EGLVILDAEG RLQPGVASHW CVSEDRRVYR FHLRPDAQWS
NGDKVTAQDF VYAFRRLMDP KTGAPYANIL YTLKNAEKVN KGQLPVEALG VQAPADDRLE
ITLDEPVPYL LAQLTHVTAK PLHRRSIETY GSDFVHPGHL VTNGPFMLAE FSPNDRLVLV
KNPHHYDAAR IGLDKEIFYP LEDRSAALRR FLAGEIQSYS DVPVDQIRFV RRTLGDQFKL
APNLGTYYYA LDTRRPPFDD IRVRKALSMV IDRDFLAERI WGGTMEPGYS FVPPGIESYG
TPAELAFKDK TPIEREDEAK KLLAEAGFGP SGKTLTVEIR YNISENHRAT AVAVADMWKQ
IGVETTLVAS DATSHYAFMH ERRPFNVLRY GWFADFPDAE NFLFLAESGN KGLNISSFSN
ETYDSLMRDA AQEDDATRRT ALLHQAEALL LAEGPYVPLL IFKSKNLISP KLRGWHTNAL
DVHRGRYISI AP