Gene Bind_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1058 
Symbol 
ID6201022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1214252 
End bp1215286 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content59% 
IMG OID641705051 
ProductCBS domain-containing protein 
Protein accessionYP_001832190 
Protein GI182678044 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.279471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.884616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC GCGACACCTA TGCGGCAGGT GAAAATGCCG TGCCTGATCG GGGCGGGCAG 
GGCCAACGGC CCAATCTGAT TGATCGCTTG CGCCTTTTGT TCGGTCTCGG TGGCCCGACG
ATCCGCGATG AATTGCAGGA GGCCTTGAGC GATACGGCCA CCGACGCAGA AATTTCACCG
CAACAGCGGA CCATGCTGAA GAATGTCCTC GGTCTGCATG AGGTGCGGGT CGAAGATGTC
ATGGTGCCGC GCACTGATAT TATCGCCGTC TCGCTCGACT CGACCCTTGC CGAGGTCCTG
GATCTGTTCC GTTCCGCTGG CCATTCTCGT CTGCCCGTGC ATGGCGACAC GCTCGACGAT
CCACGCGGTA TGGTCCACAT CCGCGATTTC GTCGATTATC TTGCCGGTCT CGCCTTGCCT
GAAACCGAGA CGGTCCCGTC TCATGCCGAC AAAACGCCGC CGGCGGTGGT GAAGACGCTG
GCCGGCCCGT ATAAGCTCGA TATTGGCGCG ACGACATTGG CCGAAGCCAA GATCCTGCGG
CCCGTTTTGT TCGTGCCCCC CTCCATGCCG GTCCTTGATC TGCTCGTGAA AATGCAGGCG
ACCCGCACGC ATATGGCGTT GGTCATCGAC GAATATGGCG GGACAGACGG TTTGGCCTCG
ATCGAGGATA TTGTCGAAAT GATTGTCGGT GATATCGAAG ACGAGCATGA TTTGGAGGAA
AGTCCTAAAA TCGAAGCGAC GGAAGACGGC GCTTTCATCG TCGATGCGCG CGCCGATCTC
GAGGAAGTCG GTGCCGTGAC GGGGATCGAC TTTGAGGCGA TGGATGTCAC AGAGTCTTTC
GATACGCTCG GGGGGCTGAT CACCGCCATG ATGGGGCATG TGCCCGTCCG GGGCGAAATG
ATCGAAGAAG GGACGCTCAG TTTCGAAATT CTCGACGCCG ACCGGCAAAA GATCGAACGC
ATCAAGATTT ATGGCGCGCC GGGCGGGCGC GTTGGCGAGG AAACAGGCTA CGTCGCAGAG
AAAGGCAAAG CGTGA
 
Protein sequence
MSERDTYAAG ENAVPDRGGQ GQRPNLIDRL RLLFGLGGPT IRDELQEALS DTATDAEISP 
QQRTMLKNVL GLHEVRVEDV MVPRTDIIAV SLDSTLAEVL DLFRSAGHSR LPVHGDTLDD
PRGMVHIRDF VDYLAGLALP ETETVPSHAD KTPPAVVKTL AGPYKLDIGA TTLAEAKILR
PVLFVPPSMP VLDLLVKMQA TRTHMALVID EYGGTDGLAS IEDIVEMIVG DIEDEHDLEE
SPKIEATEDG AFIVDARADL EEVGAVTGID FEAMDVTESF DTLGGLITAM MGHVPVRGEM
IEEGTLSFEI LDADRQKIER IKIYGAPGGR VGEETGYVAE KGKA