Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_3893 |
Symbol | aroB |
ID | 4480104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | - |
Start bp | 4671626 |
End bp | 4672705 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 639728506 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_871517 |
Protein GI | 117922325 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000115633 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.386529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAC AAATTCAGGT TGATTTAGGT GAACGTAGTT ATCCCATTTA TATTGGCCAG AGTTTGATGA GTGATGGCGA GACCCTGTCT CGCTACCTGC TGAAAAAACG TATCCTTATC GTCACCAATG AAACTGTCGC GCCCTTGTAT CTTAAGCAGA TCCAAGACAC GATGGCTTCG TTTGGTGAGG TAACCAGCGT CATCCTTCCC GATGGCGAGC AATTCAAAGA TTTAACGCAT TTAGATTCCA TTTTTACGGC TTTACTGCAA CGCAATTATG GCCGTGATTC AGTGCTGGTG GCCCTCGGTG GCGGGGTGAT TGGTGACATG ACGGGTTTTG CCGCTGCCTG TTACCAACGT GGCGTCGATT TTATTCAAAT TCCGACCACA CTACTATCAC AAGTAGACTC TTCCGTTGGC GGAAAAACCG CCGTTAATCA TCCGCTTGGC AAAAATATGA TCGGGGCTTT TTACCAGCCA CAGATCGTCA TTATCGATAC TGAATGCTTG CAGACCTTGC CCGCGCGTGA ATTTGCCGCT GGGATGGCAG AAGTCATTAA GTATGGCATC ATGTGGGATG CTGAATTTTT TCAATGGCTT GAGAACAATG TTCAAGCATT GAAAAGCCTA GATACTCAAG CTTTGGTCTA TGCGATTTCT CGCTGCTGTG AGATTAAAGC CGATGTCGTG AGTCAGGATG AGACCGAGCA GGGCGTCCGC GCGCTATTAA ACCTTGGGCA TACCTTTGGT CATGCGATCG AGGCCGAGAT GGGTTATGGC AATTGGTTGC ATGGTGAAGC GGTTGCTGCT GGCACAGTCC TTGCTGCACA AACGGCTAAA TCCATGGGAT TGATTGATGA GTCAATTGTT CGTCGTATTG TGCAATTGTT CCATGCTTTC GATCTGCCAG TAACAGCGCC GGAATCTATG GATTTCGATA GTTTTATTAA ACACATGCGT CGCGATAAGA AAGTGTTAGG TGGTCAGATC CGACTGGTAC TCCCGACGGC CATTGGTCGA GCTGATGTCT TTAGCCAAGT TCCGGAATCT ACCCTAGAAC AGGTTATCTG CTGCGCATAA
|
Protein sequence | MTKQIQVDLG ERSYPIYIGQ SLMSDGETLS RYLLKKRILI VTNETVAPLY LKQIQDTMAS FGEVTSVILP DGEQFKDLTH LDSIFTALLQ RNYGRDSVLV ALGGGVIGDM TGFAAACYQR GVDFIQIPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP QIVIIDTECL QTLPAREFAA GMAEVIKYGI MWDAEFFQWL ENNVQALKSL DTQALVYAIS RCCEIKADVV SQDETEQGVR ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GTVLAAQTAK SMGLIDESIV RRIVQLFHAF DLPVTAPESM DFDSFIKHMR RDKKVLGGQI RLVLPTAIGR ADVFSQVPES TLEQVICCA
|
| |