Gene NATL1_06701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06701 
SymbolaroA 
ID4781288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp616159 
End bp617493 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content40% 
IMG OID640083946 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001014495 
Protein GI229000869 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.36733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGCTC CCACTAAAGA TCAGTCTTTA AGAAACCTTC AAAAAGGAGG GGAGCTCTAT 
GGAAAAGTGA AAGTACCTGG AGACAAGTCA ATCTCACACC GTGCACTACT TTTTGGGGCT
ATTGCTAAGG GGAAAACACT AATTGAGGGC CTTTTACCTG CTGAAGATCC ATTAAGTACT
GCTGAATGCC TTAGGTCAAT GGGCGTAAAG ATTAGTCCAA TCAAGAAAGG AGACATTATT
GAAATTGAAG GCGTTGGATT AAATGGCCTC CAGGAGCCAC AAGATATTTT GAACTGCGGA
AATTCAGGAA CAACTATGAG ATTAATAATG GGATTATTAG CCGGTCAAAA AGATCATCAT
TTCATCCTCA CAGGCGATAA ATCACTTAGA AATAGGCCGA TGAAAAGAGT AGGACAGCCG
TTAAAAATGA TGGGGGCTAA AGTTTTCGGA AGATGCGGTG GAGACTTGGC TCCTCTATCG
ATTATTGGGA ATAAATTAAG AGGTGCCGTA ATTGGTACAC CAGTAGCAAG TGCTCAGATA
AAATCTGCAA TCCTACTAGC TGCTCTTAAT GCAGAAGGCT CAACGACGGT TATTGAACCC
GCCAGATCAA GAGATCATAG CGAGAGAATG CTAAAAGCCT TCGGAGCTAA TCTAGAGGTT
GGTGGAGAGA TGGGTAGACA TATAACTGTA TCCCCTGGTA AAGATCTAAA AGGTCAATCA
ATTATTGTTC CTGGAGATAT TAGTTCCGCT GCATTTTGGC TCGTTGCAGG TAGCATCATA
CCCGGATCAG AGTTGGTTGT AGAAAATGTT GGTCTAAATC CAACAAGGAC TGGAATACTT
GACGTATTAG AAGCAATGGA AGCAAATATC AACGTAATAA ACAAAAGAGA TGTAGCCGGT
GAACCTGTCG GAGATATTGA AGTTTTCTAC AAAGAAAACT TAAAACCATT TAAAATTGAC
GATGAGATAA TGCCACGGCT TGTTGACGAG ATACCCATTT TATCCGTAGG AGCATGTTTT
TGTAATGGTA TCAGTCAAAT AAAAGGAGCA AGTGAGCTAA GAGTTAAAGA AACTGATCGA
TTAGCTGTAA TGGCAAGGCA ATTAAAAAGG ATGGGAGCCA GCGTAGATGA GCATCAAGAT
GGTCTAACTA TCTATGGAGG AAAAAGCTTA GAAGGATGCG AACTTGATAG CGAGGATGAT
CACCGTATAG CCATGAGTTT AGCTATTGCA TCAATAATGG CTAATTCTAA TTCGACATTA
CGACGTAGTG AGGCTGCAGC AATTTCATAT CCTGATTTTT GGAGTGATCT TAAGAGACTT
CAACAAAAAA ATTAG
 
Protein sequence
MKAPTKDQSL RNLQKGGELY GKVKVPGDKS ISHRALLFGA IAKGKTLIEG LLPAEDPLST 
AECLRSMGVK ISPIKKGDII EIEGVGLNGL QEPQDILNCG NSGTTMRLIM GLLAGQKDHH
FILTGDKSLR NRPMKRVGQP LKMMGAKVFG RCGGDLAPLS IIGNKLRGAV IGTPVASAQI
KSAILLAALN AEGSTTVIEP ARSRDHSERM LKAFGANLEV GGEMGRHITV SPGKDLKGQS
IIVPGDISSA AFWLVAGSII PGSELVVENV GLNPTRTGIL DVLEAMEANI NVINKRDVAG
EPVGDIEVFY KENLKPFKID DEIMPRLVDE IPILSVGACF CNGISQIKGA SELRVKETDR
LAVMARQLKR MGASVDEHQD GLTIYGGKSL EGCELDSEDD HRIAMSLAIA SIMANSNSTL
RRSEAAAISY PDFWSDLKRL QQKN