Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07421 |
Symbol | aroB |
ID | 4780370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 683270 |
End bp | 684376 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640084017 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001014565 |
Protein GI | 124025449 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00250573 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAATAAGG ACAACCACCA TATTAAAGTC TCTCTAACTA ACAACCCATA CGAAATAGTT ATTGGAAAAA ATAGTCTTGA AAGCATAGGA GATGAGTTAT TTAATATTGG TTTTAGGGAA GGACTAAAAG TATTAGTCGT GTCCAACAAG GAAGTCTCTG ATCACTATGG TGATTGCATA ATCAAAAGTC TGATAAAAAG TAAATTCAAA CCAAAGCTTT TGATAATAAA AGCTGGAGAA GATCAAAAAA ATCAATCTTC TATAGATTTA ATCCACAATG CGGCATATGA AGCGAGGTTA GAGAGAGGAT CATTAATGAT TGCCCTTGGA GGTGGCGTGA TTGGAGATAT GACTGGCTTT GCAGCTGCTA CATGGTTGCG TGGAGTAAAT GTAGTCCAAA TCCCCACCAC ATTACTCGCC ATGGTTGATG CTTCTATTGG TGGTAAAACA GGGATAAATC ATTCAAAAGG TAAAAATCTT ATAGGTGCTT TTCATCAACC TAGACTGGTC TTAATAGACC CTAAAACATT AATTTCTCTC CCATCACGAG AGTTCAAAGC AGGTATGGCT GAAATAATAA AGTACGGAGT TATATCAGAC TTAGAACTAT TCGATCTTCT TGAAAGGCAA GAAAATATTG CTGATCTTTC AAACATAAAA GAAAAACTAC TATTAGAAAT AATTAAGCGT TCTGCTAAAT CTAAAGCAGA AATTGTTATA AAAGATGAGA AGGAAAGTGG AGTTAGAGCA TTTTTAAATT ATGGTCACAC ATTTGGCCAC GTAATAGAAA ATCTTTGTGG TTATGGAAAA TGGCTGCATG GCGAGGCAGT TGCAATGGGT ATGGTTGCAG TTGGTCAGTT AGCGGTTCAG AGGGGACTAT GGAACGAGGA TAACGCGAAA AGGCAGAAAC GATTAATAGA GAAAGCAGGC TTACCCTCTA ATTGGCCTAA GCTTGATATA GAAAGTGTTC TAAGCTCACT TCAAGGAGAC AAGAAAGTTA AGAACGGCAA GGTGAGTTTC GTTATGCCCT TAAAAATTGG TGATGTAAAA TTATTTAATA ATATTTCTAA TAAAGAAATA CGTGAATGCT TGCAAAAAAT TAGCTAA
|
Protein sequence | MNKDNHHIKV SLTNNPYEIV IGKNSLESIG DELFNIGFRE GLKVLVVSNK EVSDHYGDCI IKSLIKSKFK PKLLIIKAGE DQKNQSSIDL IHNAAYEARL ERGSLMIALG GGVIGDMTGF AAATWLRGVN VVQIPTTLLA MVDASIGGKT GINHSKGKNL IGAFHQPRLV LIDPKTLISL PSREFKAGMA EIIKYGVISD LELFDLLERQ ENIADLSNIK EKLLLEIIKR SAKSKAEIVI KDEKESGVRA FLNYGHTFGH VIENLCGYGK WLHGEAVAMG MVAVGQLAVQ RGLWNEDNAK RQKRLIEKAG LPSNWPKLDI ESVLSSLQGD KKVKNGKVSF VMPLKIGDVK LFNNISNKEI RECLQKIS
|
| |