Gene NATL1_07421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07421 
SymbolaroB 
ID4780370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp683270 
End bp684376 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content36% 
IMG OID640084017 
Product3-dehydroquinate synthase 
Protein accessionYP_001014565 
Protein GI124025449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00250573 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAATAAGG ACAACCACCA TATTAAAGTC TCTCTAACTA ACAACCCATA CGAAATAGTT 
ATTGGAAAAA ATAGTCTTGA AAGCATAGGA GATGAGTTAT TTAATATTGG TTTTAGGGAA
GGACTAAAAG TATTAGTCGT GTCCAACAAG GAAGTCTCTG ATCACTATGG TGATTGCATA
ATCAAAAGTC TGATAAAAAG TAAATTCAAA CCAAAGCTTT TGATAATAAA AGCTGGAGAA
GATCAAAAAA ATCAATCTTC TATAGATTTA ATCCACAATG CGGCATATGA AGCGAGGTTA
GAGAGAGGAT CATTAATGAT TGCCCTTGGA GGTGGCGTGA TTGGAGATAT GACTGGCTTT
GCAGCTGCTA CATGGTTGCG TGGAGTAAAT GTAGTCCAAA TCCCCACCAC ATTACTCGCC
ATGGTTGATG CTTCTATTGG TGGTAAAACA GGGATAAATC ATTCAAAAGG TAAAAATCTT
ATAGGTGCTT TTCATCAACC TAGACTGGTC TTAATAGACC CTAAAACATT AATTTCTCTC
CCATCACGAG AGTTCAAAGC AGGTATGGCT GAAATAATAA AGTACGGAGT TATATCAGAC
TTAGAACTAT TCGATCTTCT TGAAAGGCAA GAAAATATTG CTGATCTTTC AAACATAAAA
GAAAAACTAC TATTAGAAAT AATTAAGCGT TCTGCTAAAT CTAAAGCAGA AATTGTTATA
AAAGATGAGA AGGAAAGTGG AGTTAGAGCA TTTTTAAATT ATGGTCACAC ATTTGGCCAC
GTAATAGAAA ATCTTTGTGG TTATGGAAAA TGGCTGCATG GCGAGGCAGT TGCAATGGGT
ATGGTTGCAG TTGGTCAGTT AGCGGTTCAG AGGGGACTAT GGAACGAGGA TAACGCGAAA
AGGCAGAAAC GATTAATAGA GAAAGCAGGC TTACCCTCTA ATTGGCCTAA GCTTGATATA
GAAAGTGTTC TAAGCTCACT TCAAGGAGAC AAGAAAGTTA AGAACGGCAA GGTGAGTTTC
GTTATGCCCT TAAAAATTGG TGATGTAAAA TTATTTAATA ATATTTCTAA TAAAGAAATA
CGTGAATGCT TGCAAAAAAT TAGCTAA
 
Protein sequence
MNKDNHHIKV SLTNNPYEIV IGKNSLESIG DELFNIGFRE GLKVLVVSNK EVSDHYGDCI 
IKSLIKSKFK PKLLIIKAGE DQKNQSSIDL IHNAAYEARL ERGSLMIALG GGVIGDMTGF
AAATWLRGVN VVQIPTTLLA MVDASIGGKT GINHSKGKNL IGAFHQPRLV LIDPKTLISL
PSREFKAGMA EIIKYGVISD LELFDLLERQ ENIADLSNIK EKLLLEIIKR SAKSKAEIVI
KDEKESGVRA FLNYGHTFGH VIENLCGYGK WLHGEAVAMG MVAVGQLAVQ RGLWNEDNAK
RQKRLIEKAG LPSNWPKLDI ESVLSSLQGD KKVKNGKVSF VMPLKIGDVK LFNNISNKEI
RECLQKIS