Gene Bind_2194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2194 
Symbol 
ID6199066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2508662 
End bp2510482 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content59% 
IMG OID641706185 
Productextracellular solute-binding protein 
Protein accessionYP_001833303 
Protein GI182679157 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.808425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTGCT CCTGTTTCAA ACCTGTTTTC CTCCTCCTCT TGCTTTGGCA GACATCCTTT 
GGATTCGCAT CGCTTGGAAG GGCCGCAGAA GCCGCGCATG CGATCGCCAT GCATGGCGAG
CCGGCTCTCT CAGCGGATTT TCCCCATTTT CCCTACGCCA ATCCCAATGC ATCGCAGGGC
GGCACTCTGC GTCTTGGTTT TCAAGGGACA TTCGACAGTC TCAATCCATT CAATCTCAAA
GCCGGTTCGA CCGCCCAAGG TCTCAACGGC AATATTTTCG AACCCCTCAT GACGCGCTCG
CTCGACGAGC CTTTCACGCT CTACGGATTG ATCGCCGAGT CGATCGAGAC CGATGACTCA
CGCGAGTTCG TCATTTTTCA TCTCAACCCC AAGGCGCATT TTTCCGACGG CACACCCATC
ACGGCAGACG ATGTCCTCTT CAGTTTCAAC CTGTTGAAAA CCCATGGGCG GCCACAGCAT
CGCGTCTCCT ATGGCATGGT GAAATCAGCC ACTGCTCCCG ATCCCCTGAC TGTCCGCTAT
GATCTCAATT CCGGGCAAGA CCGGGAAATG CCCCTGATGC TCGCCTTGAT GCCCGTTCTG
CCCAAACATC TCGTCAATCC AGCGACCTTC GATGAAGCGA CGCTCAATCC GCCGACCGGA
TCGGGCCCCT ATATCCTCAC GGAAGTCAAA CCGGGCGAGC GTCTCATCCT GCATCGAGAT
CCCCATTATT GGGCGGCCGA TCTGTCGACA CGGCGCGGCC TGTTCAATTT CGAGACGATC
ACCATTGACT ATTTTCGCGA CGCCAACAGC CTGTTCGAAG CTTTCCGGGC GGGCCTCATC
GATTTCCGCG AGGAGACCAG CCCCGCACGC TGGATGAAGG CTTATGATTT TCCAGCTCTC
ACCGAGGGCC GGATCTTCAA GGAAGCTTTG CCCATCGGCG GCCCCAAGGG CATGGAGGGA
TTCGTCTTCA ATCTGCGGCG CCCCCTCTTC ACGGATATCA AGGTTCGCGA GGCGCTCGCC
TCCCTCTTCG ATTTCGAATG GATCAACACC AATCTTTATG GTGGCCTGTA CCGCCGCACA
CAGAGTTTCT TCGACGAATC GGAATTAGCC TCCACCGGCC GCCCGGCGAG CGAGGCCGAA
CGGCGCCTGC TCGCGCCGTT TCCCGGCGCG GTGCGTGAGG ATATTTTGGA AGGGCGCTGG
CATCCACCGC AGACCGACGG CTCAGGGCAG GATCGCACCC AACCTCGCCA TGCTCTCGGG
CTCCTGCACG AGGCAGGTTA TGATTTGAAG GACGGCCTCC TCTCCAAGGA GGGTAAGCCG
CTCTCCTTCG AGATCATGGT CACGGATCGT AATCGAGAGA GGCTGGCGCT TGATTATGCC
CGTTCGCTGA CCCGGATCGG CGTCGATGCC CATGTCCGCC TGGTCGATGA AGTTCAGTAT
CAGCGGCGGC GCCAAAAATT CGATTTCGAC ATGATGATCG GCAGTTGGAT AGCCTCCGCC
TCGCCCGGCA ATGAGCAGCG GTCACGCTGG GGCTCGAAAA GCGCCGATCA GGAAGCCTCG
TTCAATCTTG CCGGTGTCAA ATCACCCGCC GTGGATGCGA TGATCAATCA TCTCCTCGCC
GCACGAACGC ATGACGATTT CGTCACAGCC GTACGGGCCT ATGATCGTGT TCTGCTTTCA
GGCTTTTACG TCGTGCCGCT GTTTCATTCG CCGACACAAT GGATCGCCGG AACGACACGG
CTCGGCCGGC CCGATGTCCT GCCCCGCTAT GGCGCGCCGA GCGGCAGCGC GACCTTGGAA
ACCTGGTGGA TGCGGCCGTA A
 
Protein sequence
MPCSCFKPVF LLLLLWQTSF GFASLGRAAE AAHAIAMHGE PALSADFPHF PYANPNASQG 
GTLRLGFQGT FDSLNPFNLK AGSTAQGLNG NIFEPLMTRS LDEPFTLYGL IAESIETDDS
REFVIFHLNP KAHFSDGTPI TADDVLFSFN LLKTHGRPQH RVSYGMVKSA TAPDPLTVRY
DLNSGQDREM PLMLALMPVL PKHLVNPATF DEATLNPPTG SGPYILTEVK PGERLILHRD
PHYWAADLST RRGLFNFETI TIDYFRDANS LFEAFRAGLI DFREETSPAR WMKAYDFPAL
TEGRIFKEAL PIGGPKGMEG FVFNLRRPLF TDIKVREALA SLFDFEWINT NLYGGLYRRT
QSFFDESELA STGRPASEAE RRLLAPFPGA VREDILEGRW HPPQTDGSGQ DRTQPRHALG
LLHEAGYDLK DGLLSKEGKP LSFEIMVTDR NRERLALDYA RSLTRIGVDA HVRLVDEVQY
QRRRQKFDFD MMIGSWIASA SPGNEQRSRW GSKSADQEAS FNLAGVKSPA VDAMINHLLA
ARTHDDFVTA VRAYDRVLLS GFYVVPLFHS PTQWIAGTTR LGRPDVLPRY GAPSGSATLE
TWWMRP