Gene Bind_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1863 
Symbol 
ID6201025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2122727 
End bp2123755 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content56% 
IMG OID641705850 
ProductSel1 domain-containing protein 
Protein accessionYP_001832976 
Protein GI182678830 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCC CCCATATGGG ACGGCATATC ACGAGAGACG TGGCGGCTTT TCGGCTGGAG 
CTTTACGATC CGATTTCCAT ATTGATCGGT TGCGATCCGT TCACGGATAG GGCTCCCGAG
GGCGACAAAG ATATGGCGAA TCGCTTATTT GGTCAAAACC CAGGTACCGC TCCTCCGTGC
CGGCTACGGT TCCCTGCGCG AATGTCGACG CATGCGCTGA GCCTCGGCGT GGCTGGCCTG
GCGTTTCTCT TGTTTCATGC TTCAGCCATG GCCTTTGATG GGGCGGGCGA AAGCGGTGCT
GGCAGCAAGA TACCGCTGCA GATTTTCAAG AATCCGCAAG CTGCTTTGCG CGCGGGCCTT
GAAGGGTCAC GGTCGGGCAG CGATCATTGG ATCGAAGCAC TGAAATATGC TGCGGCGGGT
GGCGAATCGC TTGCGCAGTG GAAGCTCGGA AAAATGTATG CGAGCGGCGA CGGTGTTCCT
CACGACGACG TCAAAGCTTA TGAATATTTC TCGCAGATTG TCGCCTCCTA TGATGACGAT
AATCCCAACC GCCGCTTCAT GCCGCTCGTT TCCAATGCCT TCGTCGCGCT CGGTATCTAT
TATCTGAACG GTATCGCTAA TACCAAGATC AGTGCCGACC CCGCCCATGC GATGGCCATG
TTCCATTATG CGGCGATCAA TTTCGGCGAT CCCAACGCCC AATATAATCT GGCGCGCATG
TATCTCGATG GCGCGGGCAC AGCCAAGGAC AGCCGCCAAG CGGTCCGCTG GTTGTCTCTG
GCCGCCGATA AAAATCATTA TCAGGCGCAG GCTCTTCTCG GGCAGGTCCT CTTTACCGAG
AAGGAAGGCA TTCTACATCA GCGCGCCCGT GGCCTGATGT GGCTGACCCT TGCGCGGGAG
GCGCCCCTGG ATCAGCGCAA GGATAAGTGG ATTATCGATC TTTATGATCA AGCCATGGCG
TCGGCCAGTT CGGCGGAACG GCAGATTGCG CTTAGCTATC TGGAAAATCA CTTGAAGAGA
CAAGATTAG
 
Protein sequence
MMAPHMGRHI TRDVAAFRLE LYDPISILIG CDPFTDRAPE GDKDMANRLF GQNPGTAPPC 
RLRFPARMST HALSLGVAGL AFLLFHASAM AFDGAGESGA GSKIPLQIFK NPQAALRAGL
EGSRSGSDHW IEALKYAAAG GESLAQWKLG KMYASGDGVP HDDVKAYEYF SQIVASYDDD
NPNRRFMPLV SNAFVALGIY YLNGIANTKI SADPAHAMAM FHYAAINFGD PNAQYNLARM
YLDGAGTAKD SRQAVRWLSL AADKNHYQAQ ALLGQVLFTE KEGILHQRAR GLMWLTLARE
APLDQRKDKW IIDLYDQAMA SASSAERQIA LSYLENHLKR QD