Gene NATL1_02521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02521 
SymbolmenF 
ID4779310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp232843 
End bp234243 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content34% 
IMG OID640083517 
Productisochorismate synthase 
Protein accessionYP_001014081 
Protein GI124024965 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAG AACCCGTCTT TAGTGAGCTT TTAGTTAGTT CTCTTCAGAA ATGGAGTGTA 
AGAAAAGTTG ATGAATGTAT CTTGAGCATT TCTGTTCCTG TAAGTCAAAC AGATCCTTTA
ACGACATTAC CTTTAATTGC TGAAAAACAT CAGTTTAGAT TTCTTTGGGA TCTCTCTCCC
GGATTGTGCC TTTCTGCCGG AGGTCATTGT CAATCTTTGG ATTTATCTGG ACCAAAGAGG
TTTGAAAATG CACAAAGATT TAGTGATGAG ATTTTTACTC GATTAATAGA AATTTCTCCA
AGTCCAGCAT TTTCAGCCTC AAGGATTTTA TTTTCATTTT CTTTCTTTGA TCAAATTAAA
AGTAATGAAA AATCAATAGA TGATAAATTT TCCCTACAGG CTGTTTTGCC TAAGTGGCAA
TTAACCGCAA AAGATGGATT AACATGGTTA CGTTTAAATT CAGTTGCCCA AAATCCTTCA
GATGTTCGTG AAGCCATAGA AAGATTATGG TCAATTCGAG AAAAGATAAA TAAATCACCA
GTCCGAATTG TCAATGATGA AAAAGAAAAT TTCTTAGTCG ATAATTTCTC AGATGATTGG
AAATCGCAAT ATCGAGATGC GTTGGCTAAA GGGATCGAAT TAATAAATGC AGGTGATTTG
GATAAACTTG TATTGGCTAC GAGACAACAT CTCTCCCTAA GAAAACCATT AGATCCACTG
CATTTACTTG CTCGTTTAAG AGTTCAGCAA ACTAATAGTT GTCGATTCCT ATGGCAAAAA
AATCATGATG AGTCATTTTT TGGTGCATCG CCTGAAAGAT TGATAAGTCT CAATCAAAAT
CAGTTATTAA TCGATGCTTT AGCTGGTACT GCAAAAAAAG GTGATGATGG GAGAGAATTG
CTTGCTTCCT CTAAAGATTT AAGAGAACAC CATTTTGTAG TTAATTCTAT TGTCGAACAA
CTTTTAAAAA GAGGTATTAA AGCAAGTTAT CCATCTCAAC CAAAATTAAT GACTCAAGAC
CATTTAATTC ATTTGCATAC TCTTATTCAA GCCTCTGTTA AAGAGAAATC TCCTCTTGAT
TTAGTTGAAG CACTTCATCC AACCCCTGCT GTTGCAGGAT TACCTCTGAA TAAATCTTTG
AGTTGGTTGA GAGCTTTAGA ACCATTTGAC CGTAAAACAT ATGCTTCTCC AATTGGGTGG
ATAGATAAAA ATCAAAATTC AGAATTTAGG GTTGCCATAC GTTATGGACA GCTTAAAGCT
AATGAACTTA AATTATTTGC TGGCGCTGGT TTAGTAAAAG GTTCAACTGT TGAGGGGGAA
ATGCAAGAAG TAGCTTTGAA ATTTGAAGTT TTAAGAAATC AATTAAATTT AGATAGAGTT
AATTGTTCAA ATGATCTATA A
 
Protein sequence
MNLEPVFSEL LVSSLQKWSV RKVDECILSI SVPVSQTDPL TTLPLIAEKH QFRFLWDLSP 
GLCLSAGGHC QSLDLSGPKR FENAQRFSDE IFTRLIEISP SPAFSASRIL FSFSFFDQIK
SNEKSIDDKF SLQAVLPKWQ LTAKDGLTWL RLNSVAQNPS DVREAIERLW SIREKINKSP
VRIVNDEKEN FLVDNFSDDW KSQYRDALAK GIELINAGDL DKLVLATRQH LSLRKPLDPL
HLLARLRVQQ TNSCRFLWQK NHDESFFGAS PERLISLNQN QLLIDALAGT AKKGDDGREL
LASSKDLREH HFVVNSIVEQ LLKRGIKASY PSQPKLMTQD HLIHLHTLIQ ASVKEKSPLD
LVEALHPTPA VAGLPLNKSL SWLRALEPFD RKTYASPIGW IDKNQNSEFR VAIRYGQLKA
NELKLFAGAG LVKGSTVEGE MQEVALKFEV LRNQLNLDRV NCSNDL