Gene Bind_1095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1095 
Symbol 
ID6199922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1255217 
End bp1258228 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content60% 
IMG OID641705087 
ProductSel1 domain-containing protein 
Protein accessionYP_001832226 
Protein GI182678080 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAACAGG ATTCGGGACA GGATTTACCG CGTAGTGCCA AGGCCCAGAC AGCTCCACTT 
GGGAAGGGGG CGGCGGTCTC GCGCAGGCCT GCATTTTTGG CAGGAACCGA AACCTTGGAA
GAACGGGCTC AGAGGGCTTG GGCTTTCGCG CATAATAGAG GAATGATCCG GGCGCTCGAG
AATCGCCGTG ACCGGGAATG GGAAATGGTC GCCAGTCTTT CCTGCTCCGG AGAGGATGGA
TTTCTTCCGT TCGATGAATG GCAGGCGGTA CGCTCGGATG GCCATGGGTT GCAAAGTGAC
GAGTTGCGTC GGGAGACGAC TTTGGGAGCG GATTTCAAGA CCGGTCGCGG ATTGCCAGAG
GGTCAGGCTT CTCTCGAAGG CCTGTGGAGC GAACTTGAAT CGGAATTACT GGCCTTCGCC
CGCCGTCGTG GCCATGCTTT CGCTTCCAGC GGCCGGGTCG GCGTGGATCA TCGGCAGAAC
GCGGCAAAAG GACCGGGCGC GAGTAGCACC GCCTCAGCAG CCCAGAGCGC TTCTTTGTCG
GACTTACGGG ATGATGTCGG TCAACTCGCT GGTCAACTCG ATGCCATGCG GCAGGAAGCG
GATCGTCGCG ACAGCCTGTT GCGGAGCCAG GCGGAACGCA TTGGCGACAT CGGGCGGTTT
CGCAAGGAAA TCAGCGATCT GACGAGCCGG CTCGGTGCTT TCGCGCCGCT GGAGGCCTTG
CGTGAACTTG ACCGGGAAAT CAAGGCTATT GCCCAACGGT TTGAAAAATT AGGGGAAACC
GAGAGCGACG CCACCCCGAT ACTGCGCTTG CGGCAGCAAT TGCATGAAAT TCGCGAACTC
ATCACGGCCA TGGCGAGCCT TGCGGAAAAT ACGGTTCGGC AAAAGCCGGC CGCTGATCTG
GAAGTCTTGC GAGGGACGTT GAACGATCCG GTCCGTGCGG AATCGCTGCA GAAGATCGAG
ACCCAGATCG AGGTTTTGGG GCAAAGGGTC GATACAGCCA TCGCGCAGGC GGAAACATCC
GGGCAATATG CCGCCCTGGC GCGGCAGATC GAGGCCGTCA AACACCAATT GACGGCGCGC
ATCGACGCCA ATGCCACGCT CAGCGCGCCG CATACGCAAC AACTCGAACA ATTGGTGCGT
GTGCTGGCCG ACAAAATGGA TTCCTTGCCC GATCCGCGCA AGAGCGAACA GAAGTTCGAG
GTTCTTCAAT CCGAACTCGT CCGGATCGAC AATCGCCTCG ACCAAAGTGA CAAGATCATC
GCCTCTCTGG CGGCCATGGA AATGACCATT ACGCGGCTTT CCGCCGAGAT GGGCGTGATC
AAGGGCTCGT TGCGGCAATC TGCCGAAGCG GCCGCGCGTC AGGTCATTGA AGAGCTTCGG
CAGACATCCT CCACGGAACC TCTGATTCCG GCTGATTTGG AAAACGAGAT CGAAAAACGG
TTCGAGCGCA CCGAGGTCAT GATCCGCGAT GGCCTGGAGA GACTTGCCAG CCGGCTGACC
GATCTTGAAA CCGTTACACA TCTGGCTCGA GGCCCTGTGA CACAGGCCGG GTCCCTGGGA
TCATTGCTGG CGCCGATCTT TGCACCCGCC AGCGAGGGGC AAGGGGAAAC AGCCAGTGTT
TTTAATATTT CCCTCGACAA GGCTCCGGGA TCGGTGCGGG AAGACCTTGA CCCCTGGCGT
TCTGAGCGGG TCTCCTCGGT ACGCGGTTCT GTCTCGGCAG GCCAGAAAGT TTTGCCTCGT
GCGGAGACAG GGCCGGTTCC CCTTCAAGGT GAACGCCGGG CCAGGCAAGA GAACAAGGGC
CCCACGCTGG TCGAGGATGA ATTGCTCGAA CCGGGCAGCG GCCGTCCCCT CTTGTTGCGC
TCCAGCGGAA TGAAGAAAGG GACTTTCAAG GATCGTGAAT CCGAGGCGAT CGCGGGGCCC
AAGGCCAGAA TGAAGCAAGG GGAATCCGTG CCGAAGGGTG CGGACAAACT GGTCCTTCCG
CCTTCGCGAG AGCAATCCAC CTCTTTTGAT CGGATGATCC GTCTGCTCAA AAGCGCCCGC
CATTTGGTCC GGGGACGGCT AGTCTTGTTT TCGCTCGCGG GACTTTTCTC CCTGGCCGTC
CTGGATGGAT TGGTGCATGC CATCGGGAGC ATGGATCAGC GGAACAATGC GGGCCATGCG
GAACAATCCG GTCCGGCCAA GACCGCGAAT TTTGTGGAAA CCTCGCCGCA GTCCTCAGCG
CCAGCCGCCA CAATTCCGGC CACGACCCAG AAAACATCCG AAACCCCTGC CGCAGCCAAA
TCCAGCATAA ATAAAGTGGG GGCGCTGGCC TTTTCGACAC GGGAAGACAT TCTGCGCCTC
TTCCGCTTGG GTGAAGCCGG CCATGCCGGC GCGCAATATC ATCTTGCCTT GCTCTTGCAG
GAAGGCACGG CCCATGCGGA AGGAAATGAC CAGGCGACGG ATCTGCGCGC CGCCGCTTTT
TGGTACGGGA AAGCCGCTGA CCAGGGTCTC GCCCCGGCGC AATATCGGCT AGGCCTGCTC
TACGAAAAAG GCTTCGGCGT AGACCGCGAT CTTCACAAGG CGACGGATCT TTATCGCCAG
GCGGCGGAGC AGGGCAATAC ACGCGCCATG CATAATCTTG CTGTCCTCTC GGCCGAATCC
GAAAATGGTC CGCCTGATTA TGCGGCCTCT GTCAAATGGT TCACCAAGGC CGCCGAATAT
GGCCTGCGCG ACAGCCAGTA TAATTGTGCG ATATTGCTCG CCCGGGGGCT CGGCGCGCCG
CGCAATCTGG TTCAGGCCTA TGCCTGGTTC GCCATTGCGG CGGCTCAGGG AGACGAGGAA
GCCGGTCGCA AGCGCGATGA AGTGGCGCGG CATCTTTCAA CGGAAGATTA CACAGCGGGC
AAAACCATAG TCGCCGCATA CAGGCCCCAG CCGTCCAAAA TGGCGGCTAA TGAAGTGTCT
ATGCCTTCAG GCGATGGGGA GCAGGGATCG CGCCAGGCCG AGGTGATGAA GCCGAGCCTT
TCTGGTCTGT GA
 
Protein sequence
MEQDSGQDLP RSAKAQTAPL GKGAAVSRRP AFLAGTETLE ERAQRAWAFA HNRGMIRALE 
NRRDREWEMV ASLSCSGEDG FLPFDEWQAV RSDGHGLQSD ELRRETTLGA DFKTGRGLPE
GQASLEGLWS ELESELLAFA RRRGHAFASS GRVGVDHRQN AAKGPGASST ASAAQSASLS
DLRDDVGQLA GQLDAMRQEA DRRDSLLRSQ AERIGDIGRF RKEISDLTSR LGAFAPLEAL
RELDREIKAI AQRFEKLGET ESDATPILRL RQQLHEIREL ITAMASLAEN TVRQKPAADL
EVLRGTLNDP VRAESLQKIE TQIEVLGQRV DTAIAQAETS GQYAALARQI EAVKHQLTAR
IDANATLSAP HTQQLEQLVR VLADKMDSLP DPRKSEQKFE VLQSELVRID NRLDQSDKII
ASLAAMEMTI TRLSAEMGVI KGSLRQSAEA AARQVIEELR QTSSTEPLIP ADLENEIEKR
FERTEVMIRD GLERLASRLT DLETVTHLAR GPVTQAGSLG SLLAPIFAPA SEGQGETASV
FNISLDKAPG SVREDLDPWR SERVSSVRGS VSAGQKVLPR AETGPVPLQG ERRARQENKG
PTLVEDELLE PGSGRPLLLR SSGMKKGTFK DRESEAIAGP KARMKQGESV PKGADKLVLP
PSREQSTSFD RMIRLLKSAR HLVRGRLVLF SLAGLFSLAV LDGLVHAIGS MDQRNNAGHA
EQSGPAKTAN FVETSPQSSA PAATIPATTQ KTSETPAAAK SSINKVGALA FSTREDILRL
FRLGEAGHAG AQYHLALLLQ EGTAHAEGND QATDLRAAAF WYGKAADQGL APAQYRLGLL
YEKGFGVDRD LHKATDLYRQ AAEQGNTRAM HNLAVLSAES ENGPPDYAAS VKWFTKAAEY
GLRDSQYNCA ILLARGLGAP RNLVQAYAWF AIAAAQGDEE AGRKRDEVAR HLSTEDYTAG
KTIVAAYRPQ PSKMAANEVS MPSGDGEQGS RQAEVMKPSL SGL