Gene Bind_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0354 
Symbol 
ID6198752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp401158 
End bp402939 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content59% 
IMG OID641704346 
Productshikimate kinase., 3-dehydroquinate synthase 
Protein accessionYP_001831497 
Protein GI182677351 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase
[COG0703] Shikimate kinase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATG GCGTAAATTC GATTTCTCCC CCACAGCAAG AATACGCGCC GGACGACCGG 
CGCGCTCATT CGATCATCTC CAGTCTTGGT TCACGTTCCC TCGTTCTCGT TGGATTGATG
GGGTCTGGCA AGACCTCGAC CGGACGCCGC TTGGCGCAAA GGCTCGGCCT TCCCTTCGTC
GATGCCGATG TGGAAATCGA ATCCGCTGCT GGCATGACGA TTTCCGAAAT CTTCGCGCGC
CACGGAGAAG ATTATTTCCG GGACGGCGAA CGGCGGGTCA TGGCGCGGCT TCTGGCCGAT
GGGCCCAAAA TTCTGGCCAC CGGCGGCGGC GCTTTCATGA ACGAGGAAAC CCGTTCCCGC
ATCGCCGAGA AAGGCGTATC GATCTGGCTC AAGGCCGATC CTGACGTCTT GTGGCGGCGC
GTCAAGAAAC GCCCACATCG GCCGCTTCTG CAAACGCGCG AGCCGGAAAA AACCTTGCAC
CGCCTGATGG AGCAACGCTA TCCGATCTAT GCGCGGGCCG ATATTGCCGT CGAATCGCGC
GACGGACCGC ATGATGCCGT AGTCGAGGAC ATTCTGACGG CACTCGAGTT TTTCCTGCGC
TTCTCGCCCA ATCCGCCGCT CCCCCCACCC GGAACGCTCA ACCCCTCTTT TCTTGGACAA
GATTCCGCCT TGACCGAACT GGTTCCCGTT GAACTCGGCG CGCGCGCTTA TGAGATCCAT
ATTGGTCCCG ACCTCATTGC GCGGGCTGGA TCCCTGATTG CCGCTCTGGC CCCGAAAGCC
GCCTGCGCCG TCATCACCGA TGATAATGTC GCGCGCGAAC ATCTGCCCCG GCTGGAACAG
GCCCTCGCGC AACAGGGCAT AAGATATGAG ACCATCAAAG TGCGGCCGGG CGAAGGGTCG
AAATCCTTTC CGGTCTATGC CGAGGTCTGC GACGCCGTCA TCGCCGGTAA ATTCGAGCGG
CAGGATCTCA TTCTCGCGCT CGGCGGCGGC ATTGTCGGTG ATCTCGCCGG ATTTGTCGCG
GCAACGGTGC GGCGCGGCAT GCGCTTCGTT CAATTGCCCA CGACTCTCCT GTCACAGGTC
GATTCCTCGG TCGGCGGCAA GACTGGCATC AATTCACCAC ATGGCAAGAA TCTGGTCGGG
GCCTTTCATC AGCCCTCGCT GGTTCTCGCC GATACGACCG CGCTTGAAAC CCTGCCGAAA
CGTGAGTTCC GAGCGGGCTA TGCGGAAGTG GTCAAATATG GCCTGATCAA CGATCCCGAT
TTCTTCTTCT GGCTCGACAT GCATTGGCCA AACGTCTTCC AGGGGGGAGC GGATCGTGTG
CATGCCATTG CAACAAGTTG CAGAGCCAAG GCGGCCATCG TCAAACGGGA TGAATTGGAA
ACCGGGGAAC GGGCTCTGCT CAATCTCGGT CATACATTCG GGCATGCTTT CGAGGCCCTG
ACGCATTTCG ACAATGCGCG TCTCGTCCAT GGCGAAGGCG TCGCGATTGG CATGGCTTGC
GCCTTTCGCT TTTCAGTCAA GCGCGGTCAT TGTTCCCCCG AGGATGCCGC GCGCGTGGAC
AATCACTTGC GCATGGTGGG TCTGCCGACA CGTATTCGCG ACATCAAAGA TTTCGACGCG
GATGCCACGG CTATTCTTGC GGCCATGTAT CAGGACAAGA AAGTCGAGCG GGGGACTCTC
ACCTTCATTC TCGCCCGCGC GATCGGGCAT TGTTTCATCG AGAAAAAGAT CGAGGGAGAA
GAGGTCAAAG CCTTCTTGGA AGAGGAACTG ATGCTCTCCT AG
 
Protein sequence
MSDGVNSISP PQQEYAPDDR RAHSIISSLG SRSLVLVGLM GSGKTSTGRR LAQRLGLPFV 
DADVEIESAA GMTISEIFAR HGEDYFRDGE RRVMARLLAD GPKILATGGG AFMNEETRSR
IAEKGVSIWL KADPDVLWRR VKKRPHRPLL QTREPEKTLH RLMEQRYPIY ARADIAVESR
DGPHDAVVED ILTALEFFLR FSPNPPLPPP GTLNPSFLGQ DSALTELVPV ELGARAYEIH
IGPDLIARAG SLIAALAPKA ACAVITDDNV AREHLPRLEQ ALAQQGIRYE TIKVRPGEGS
KSFPVYAEVC DAVIAGKFER QDLILALGGG IVGDLAGFVA ATVRRGMRFV QLPTTLLSQV
DSSVGGKTGI NSPHGKNLVG AFHQPSLVLA DTTALETLPK REFRAGYAEV VKYGLINDPD
FFFWLDMHWP NVFQGGADRV HAIATSCRAK AAIVKRDELE TGERALLNLG HTFGHAFEAL
THFDNARLVH GEGVAIGMAC AFRFSVKRGH CSPEDAARVD NHLRMVGLPT RIRDIKDFDA
DATAILAAMY QDKKVERGTL TFILARAIGH CFIEKKIEGE EVKAFLEEEL MLS