Gene Bind_0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0797 
Symbol 
ID6199090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp890259 
End bp891239 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID641704794 
ProductPDZ/DHR/GLGF domain-containing protein 
Protein accessionYP_001831936 
Protein GI182677790 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTCA AGGACGAAGA CTGGAAAATT CCACCCGAGG CCCAGCCCAA ACAAGGCAAT 
TTTCGTTTCG ATCTCGAACA GACGATGAGT TCCATCGTCT CGATACGCTC ACGTGTGCCG
GCCGATGCCT TTACCGCCGG TATTTTGGGG ACGGAACGGT CCGGCAATGG CGTGCTGATC
GACGCCGATG GTATTGTCCT GACGATCGGC TATCTCGTGA CCGAAGCCGA GGAAGTCTGG
CTCACCACCA ATGACGGTGT GGTCGTTGCC GGCCATGTGC TCGGCATTGA TCCCGCGACG
GGTTTCGCCC TTGTGCAAGC GCTCGGCCGG CTGGAATTGC CCGTTATGCC GCTTGGGGAA
TCGCACAGCA CAAGGGTCGG CGAGAGGGTG ATCATTGGTG GCGCTGGCGG TGTCGCGCAT
GCGCTCATCG CCCATATTGT CGCCAAACAG GAATTTGCCG GCTATTGGGA ATATGTCCTC
GACGAGGCTT TGTTCACTGC TCCGGCGCAT CCCGATTGGG GCGGCGCGGC CATGATCAGC
GTGACCGGCA AGCTGCTCGG CATCGGCTCC CTGCAAGTGC CGCATCAGGT TCATGGCGAA
CAGGTGCTGC AGCTCAATAT GATGGTGCCG ATCGATTTAT TGGGACCGAT CTACGCGGAT
CTGCGGATGT ATGGTCGTCC CAATCGGCCA CCGCGTCCTT GGCTCGGCCT GTTTGCCGCC
GAGGATCATG ACAGGATCGT GGTCATTGGC TTTGCCGGTA ATGGTCCAGC CAAGCGCGCG
GGACTGAACG AGGGGGATAC GATCCTGGCT GTCGCTGGCC ATCCGGTCTC GACGCTCGTC
GATTTGTTTC GGCATATCTG GGCCCTGGGG GCCGCCGGAT GCGATGTTCC GCTCACGCTT
GAACGGGAAG GGGATGTGTT TGAGGTTCAT CTCACATCCG CCGATCGGGA GCGCTATCTG
AAATCGGCTT CAATGCATTA G
 
Protein sequence
MVLKDEDWKI PPEAQPKQGN FRFDLEQTMS SIVSIRSRVP ADAFTAGILG TERSGNGVLI 
DADGIVLTIG YLVTEAEEVW LTTNDGVVVA GHVLGIDPAT GFALVQALGR LELPVMPLGE
SHSTRVGERV IIGGAGGVAH ALIAHIVAKQ EFAGYWEYVL DEALFTAPAH PDWGGAAMIS
VTGKLLGIGS LQVPHQVHGE QVLQLNMMVP IDLLGPIYAD LRMYGRPNRP PRPWLGLFAA
EDHDRIVVIG FAGNGPAKRA GLNEGDTILA VAGHPVSTLV DLFRHIWALG AAGCDVPLTL
EREGDVFEVH LTSADRERYL KSASMH