Gene NATL1_06211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06211 
Symbol 
ID4779544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp564292 
End bp565533 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content36% 
IMG OID640083898 
Productglutathione S-transferase 
Protein accessionYP_001014448 
Protein GI124025332 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0625] Glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.385137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.975907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAA CCAGAAATGC AATGAGTTGG GAAGAATTAG CAATATATGC AGCCGATCCA 
ATTGATCAAA TCAATGGATT AAACAACCCA TATTCCACAC TTCGTCTTTT TAATAAAAGT
GAGTCTCAGG CAATTGTTAC TCTTTACAGA GACAATCATG CTTGGTGTCC TTATTGCCAA
AAAGTTTGGA TTTGGTTAGA GCTTAAAAAT ATTCCTTACC GGATTAAAAA AGTTACAATG
CGTTGCTACG GAGAAAAAGA AAAATGGTAT TTAAAAAAAG TGCCCTCAGG AATGCTTCCT
GCAATAGAAA TTGAAAATCA TGTAATCACA GAAAGTGATG AGATCCTATT TGTGTTAGAA
GAAATTTATG GTCCTTTAGG TCAATCTCTA AATGAGAATA AAGTCTTAGA GCATAGGAGA
CTTGAAAGAG AACTATTTTC ATCATGGTGT AACTGGTTAT GTCGCAACTC TCTTTTCCAA
GCCCAAGAAG AACAAAAGAA AGAGAATTTC AAAAATGTTG CAAATAAATT TGAAAAAGAA
CTACAAAAAA ATGCTTCAGG GTGGCTAACA CCTGTCTCAA CAAAGAATGG CGAAAAGCCT
GGATCAGCAG ACGTAATTTT CATTCCATAT GTTGAAAGAA TGAATGCCTC ATTGGCTTAC
TATAAAGGAT ATTCATTGAG AGATGAGCAC CCTATTATAA ATACATGGCT TAAAAACCTT
GAAAAACTTG AGGAGTATAG AGGGACTCAA GGAGATTTTC ATACTCATGC TCATGATTTA
CCTCCTCAAA TGGGAGGTTG CTTTACTTAT TCAAATAAAA ACCAACAATT ATTTGCCCAA
GAAATTGATA TAGGTTCTGG GTTAGGACAA TTAGAACTGG TAGATTTCAA AGTTGACCAA
AAATCGGAAC AACATTTTGA GGCATTAGCT TTAGAAAGAG TGATCAGACA TAAAGAAAGG
ATCGTTTCTG TAAGCCCAAT GAAAAATAAA TTATTTGATC AGCCATTGAG AGCAGCCTTA
ACTTCTATGA TTTCAAAAAA AGATTGCCTT CCAGAGCAGA ATAGTGCCTC TGCATTACGT
TACCTCCGAG ATAGAATTTC AGTGCCGAGA GATATGCCTT TATTATCAGG GAGATTATTT
AGACAAGCTC TTGAACGCAC AGCAAATATT GATGGTAGCG ATCCAGGTCC TGAAATACCT
ACACGAAATA GGCTAGATCA AAATCCCATA CAGTTTAATT AA
 
Protein sequence
MNPTRNAMSW EELAIYAADP IDQINGLNNP YSTLRLFNKS ESQAIVTLYR DNHAWCPYCQ 
KVWIWLELKN IPYRIKKVTM RCYGEKEKWY LKKVPSGMLP AIEIENHVIT ESDEILFVLE
EIYGPLGQSL NENKVLEHRR LERELFSSWC NWLCRNSLFQ AQEEQKKENF KNVANKFEKE
LQKNASGWLT PVSTKNGEKP GSADVIFIPY VERMNASLAY YKGYSLRDEH PIINTWLKNL
EKLEEYRGTQ GDFHTHAHDL PPQMGGCFTY SNKNQQLFAQ EIDIGSGLGQ LELVDFKVDQ
KSEQHFEALA LERVIRHKER IVSVSPMKNK LFDQPLRAAL TSMISKKDCL PEQNSASALR
YLRDRISVPR DMPLLSGRLF RQALERTANI DGSDPGPEIP TRNRLDQNPI QFN