Gene Bind_3069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3069 
Symbol 
ID6198152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3503392 
End bp3504612 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content50% 
IMG OID641707016 
ProductSMP-30/gluconolaconase/LRE domain-containing protein 
Protein accessionYP_001834119 
Protein GI182679973 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.976711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAATG CTGATGCAAA ATCTCGCGCG CTTTTTATTT CGCGGCGTAT TCTGATGCGT 
TCAACTTTTG GCTTGGCGGG TGCGATGGCC TTTCCAGGCC TTGGCAAGAC ACAGAATGAC
GCCAAGTTCG GAACACCTCC AAGCGTGATT ACTCAACCCC CACGGCAATG GGGACCAACC
GCACCTCCTT CTCCCTATCC CGACCCTGAT ATCCTTGTTC TCGATCCATC TTTCAACGAC
CTGCTCTTGG GAATTACAGC AATCCGCCGC GTCTGGACGG GTGGTCGTTG GTTGGAAGGA
CCAGCGTGGT CAAGCCAAGG TCATTATCTC GTCTTCAGTG ATGTACAAGC TGATATACAA
TATCGTTATA TTTGGGAGAC TAATCAGGTC ATTCCGTATC GGCAGCCTTC GCATAATAGC
AACGGTAATA CTTTTGATTT TCAAGGACGG CAAATATCCA CTCAGGATTT TTTTCGACGG
TTGGTGCGAT GGGAACATGA TGGCAGCATG ACTGTGCTAT CCTCTCAATT TGAAGGCAAA
TCTTTGAATT CTCCAAATGA TATTGTCCCT CATCCTGATG GCAGCCTGTG GTTTACGGAT
CCCGCCTATG GCATGACGCT TTCCGAAGGT CACCCAGACA TGGCCAGAGG CCCCGCTAAT
CCGCAGGGAT TTTTCAATCC GCGCCTCGGG GCTGAGAATA GCGATCTGAT CGGAGGACAA
AAGCGGGAAT TGCCGAGCAA TGTCTATCGA CTCTCACCAG ATGGCCATCT CGATGCGGTT
ATTCAGGAGA GCCAAGTGCC AGATCCCAAT GGCCTTTGCT TCTCGCCGGA TTACAAGACA
CTTTATGTCG TAAGCACTGC AAAAGCACCA AGCGATAATG GCCCCGGTGG CAAAGGCGTT
ATATATGCAT TTGATGTGCA AGGTGACCGG CCACGTAATA TGCGTTTGTT CACGGACATG
GTCGTTGATG GGGTACATTG CGGACCGGAC GGATTACGGG CAGATATTTT TGGTAATCTT
TGGTGTTCAT CAAACGGACC GCTCGGTTAT TCAGGCGTTT TAGTCTTCAA TCCATCTGGT
AAGTTGATAG GTCGCCTACG CCTTCCGGAG GTTTGTGCCA ATGTAGCCTT TGGAGGGCCA
AAGAGAAATC ATCTCTTCAT GACTGCAAGC CAATCGCTTT ATATATTGCA AGTCCAGACT
CAAGGTGCTG CCCCCGGCTA G
 
Protein sequence
MYNADAKSRA LFISRRILMR STFGLAGAMA FPGLGKTQND AKFGTPPSVI TQPPRQWGPT 
APPSPYPDPD ILVLDPSFND LLLGITAIRR VWTGGRWLEG PAWSSQGHYL VFSDVQADIQ
YRYIWETNQV IPYRQPSHNS NGNTFDFQGR QISTQDFFRR LVRWEHDGSM TVLSSQFEGK
SLNSPNDIVP HPDGSLWFTD PAYGMTLSEG HPDMARGPAN PQGFFNPRLG AENSDLIGGQ
KRELPSNVYR LSPDGHLDAV IQESQVPDPN GLCFSPDYKT LYVVSTAKAP SDNGPGGKGV
IYAFDVQGDR PRNMRLFTDM VVDGVHCGPD GLRADIFGNL WCSSNGPLGY SGVLVFNPSG
KLIGRLRLPE VCANVAFGGP KRNHLFMTAS QSLYILQVQT QGAAPG