Gene Bind_2296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2296 
Symbol 
ID6199771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2626865 
End bp2628664 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content61% 
IMG OID641706281 
Productaminodeoxychorismate lyase 
Protein accessionYP_001833398 
Protein GI182679252 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCA TGTCTTCGCA TATCGATGGC CTGCCGGAAC AAGGATATCA GCCCATTCTC 
GCGGTGGGAG GCTTGTTTGG GCGCGGTACC GGACCGCAGA GCCCGAACGA GGCCTTGCAG
CCGCAGGCAG CGCCGCCGCC GCCCCCTAAC AAGCCCTCGA ACCGTCGGCG CGGACGCCTT
TCGGCCTTTA GCGGCTTTCT CTCCTTTTTA TTGATCGCCG CGATCGGCAT CATGGTCGTG
CTGATCTGGA CCCAGAGGAA AATGCAGGAA CCGGGGCCCC TGACGGCGGA CCGAGTTGTC
TTCATCGCGC CTGGCACGGA AGTGCCCGAC ATCATCGCCC GCCTTGATCG CGAGGGGATT
ATCGATAGTC CGCTGGGACT CAATATTGCC TTGCTGGTCG AGGGCAAGCG CTCCAAGGTC
AAGGCTGGTG AATATCTTTT CAAACAGGGT GCGAGCCTGC GTGATGTGAT GGACACGCTC
GTCAGCGGCA AACAAGTCTT GCACGCCCTG ACCCTGCCCG AGGGACTGAC CAGCACCCAG
ATCGTCGCGC GCATCATGGA AGATGATGTC CTGCAAGGCG ATATCCGAGA TGTTCCTAAG
GAAGGTACGA TCCTTCCTGA AACCTATAAA TTTACCCGCA ATTCCTTGCG CGCCGACCTC
GTTCGCAAGA TGCAGGAAGA TCAAAAGCGT ATCGTCGATC AGGTTTGGCA GAGGCGGGCG
AGCGATCTTC CTTTGAAATC CCCTTACGAA CTGGTCATCC TGGCCTCGAT CGTCGAAAAG
GAAACTGGCA AGGCGGATGA GCGGCCCCAT GTGGCCAGCG TGTTTCTCAA CCGCCTGCAG
AAACGTATGC GGCTGCAATC GGACCCGACG ATCGTCTATG GTCTGGTTGG CGGCAAGGGG
ACGCTCGGCC GGGCCATTCT GCGTTCCGAA GTCGAGAAGC CGACCCCTTA TAATACCTAT
GTGATCGACG GCTTGCCGCC GGGCCCGATC GCCAATCCCG GCCGGGCGGC GCTTGAGGCC
GTCGCGAATC CTTCCCGCAC GCGTGACCTC TATTTCGTCG CCGATGGCAC CGGCGGACAT
GTCTTTGCCG AAACACTCGA TCAACATGTG CGCAATGTTC AGAAATGGCG GCAGATCGAG
CACGATGCCA AGGAGAAGCA ACAGGCCCCT GATGTCGACA AAATTGCACC CACGCCTGAT
CAGCATGGGG AAGCCGATTT TCCGGGTGCC GTCTATGGCG GCTTGCCGCC CGCTTTGGCG
ACACCGTCCG CCGCAGCCTT ATCGTCCCTG CGCAATGCCC TCGTGAAGGA ACAAAAGGGG
GGTGCCCTTT CGGCGAAGGC GGGAGAGGCT AAACTGGCGC GTGCGTCAGC CGCGGCCGCG
CAAAAATATG GCCTTGGCCC GGGGCTTGAC GAATTGGGTC TCTCGATTCG GGGCGTGCCC
TCGGGTGCGG AAGCTGCGGC TGCGCTTGAC GGGCCGATTT CGGCGGAGAC AGCGGAAGCC
GCTGGGGGAT CGATGGTTCT GCCGGTTTCC GCTGCACGCC GGGCGGAACA AAAAGCCCGG
GCCATGCGCC TGGGGCTCGA ACCCGGCAGG GACGAACTCC CAGCTGAAGC TCCTGCCGAT
TCTAGCGTTG CCGTATTGAC GCAGCCCCAG CAGGAAGGGC GTGCTGCCGT TCATGGACCG
CACCATGGGG TCACGGATGC CTCCGAGGGC AAGTCGTTCG ATCCCTTGCT CGACAAGACC
TATGATCTCA ACTATGCGAA GACGGTCCCT GTCATCGGAA AAAATGATCC GCGGCTTTAA
 
Protein sequence
MAPMSSHIDG LPEQGYQPIL AVGGLFGRGT GPQSPNEALQ PQAAPPPPPN KPSNRRRGRL 
SAFSGFLSFL LIAAIGIMVV LIWTQRKMQE PGPLTADRVV FIAPGTEVPD IIARLDREGI
IDSPLGLNIA LLVEGKRSKV KAGEYLFKQG ASLRDVMDTL VSGKQVLHAL TLPEGLTSTQ
IVARIMEDDV LQGDIRDVPK EGTILPETYK FTRNSLRADL VRKMQEDQKR IVDQVWQRRA
SDLPLKSPYE LVILASIVEK ETGKADERPH VASVFLNRLQ KRMRLQSDPT IVYGLVGGKG
TLGRAILRSE VEKPTPYNTY VIDGLPPGPI ANPGRAALEA VANPSRTRDL YFVADGTGGH
VFAETLDQHV RNVQKWRQIE HDAKEKQQAP DVDKIAPTPD QHGEADFPGA VYGGLPPALA
TPSAAALSSL RNALVKEQKG GALSAKAGEA KLARASAAAA QKYGLGPGLD ELGLSIRGVP
SGAEAAAALD GPISAETAEA AGGSMVLPVS AARRAEQKAR AMRLGLEPGR DELPAEAPAD
SSVAVLTQPQ QEGRAAVHGP HHGVTDASEG KSFDPLLDKT YDLNYAKTVP VIGKNDPRL