Gene NATL1_02531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02531 
SymbolgshB 
ID4779433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp234221 
End bp235150 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content39% 
IMG OID640083518 
Productglutathione synthetase 
Protein accessionYP_001014082 
Protein GI124024966 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID[TIGR01380] glutathione synthetase, prokaryotic 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAC TATTTGTTTT AGATCCAATT GAAAATATCA ATCCTAAGAA GGATTCATCA 
GCAGCACTTA TGCAAGCCGC ATCAAGAGCC AACATAGATG TTTGGATCTG TACTCCCTCG
GACCTGCAAG CCCGAGGAGA CGATGCATGG GTCGTTTCTA ACAAGGTCAA TTGTGAACCA
TGGATCAATG TCCAGTCACC TCGAAGCCTT CCTTTAAGAG ATTTCTCATG CATTTGGATG
CGCAAAGATC CACCTGTTGA CGAGGCTTTT TTATACGCCA CTCATTTATT AGAAGTTGCA
GAAAGAGATG GTGTCAATGT AATTAACAAG CCTGCATCAC TTAGAGCTTG GAATGAAAAG
TTAGGAGCTT TAAGATTTAG CGATTTAATG GCTCCCACTC TTGTCGCAAG TAGGGTGGAA
CAATTAATTA CATTTGCAAA AGAGTATGGA GAAGTTGTAT TAAAACCACT TGGAGGGAAA
GGTGGGCAAG GAGTCATACG AATTGCAAAG GATGCTCCAG GCTTAGAAGC ATTACTCGAA
CTGGTTACTT CACAAGAACA TTTGCCAGTG ATGATGCAAC AATTCCTACC AGAAGTAATC
AATGGTGATA AAAGAATCCT TTTAGTTAAT GGAGAGCCAT TAGGTGCAAT TAATAGACGT
CCAAAGGAAG GAGACTTCAG AAGCAACTTG GCTTTAGGTG GAAAAGCAGA GACAACTAAA
TTAACTCCTA AAGAAATAGA GATATGTAAT CAAATAAAGC CCGCTTTACA AGAGGAAGGT
CTCTTTTTTG TTGGAATAGA TGTGATCGGA GGAATGCTAA GTGAAATTAA TGTGACGAGT
CCAACTGGTA TTAGAGAAGT AGAGAACCTA ATGAATGTGC CATTAGCAGA TCAAGTAATT
GATTACCTTA TAGATCATTT GAACAATTAA
 
Protein sequence
MKQLFVLDPI ENINPKKDSS AALMQAASRA NIDVWICTPS DLQARGDDAW VVSNKVNCEP 
WINVQSPRSL PLRDFSCIWM RKDPPVDEAF LYATHLLEVA ERDGVNVINK PASLRAWNEK
LGALRFSDLM APTLVASRVE QLITFAKEYG EVVLKPLGGK GGQGVIRIAK DAPGLEALLE
LVTSQEHLPV MMQQFLPEVI NGDKRILLVN GEPLGAINRR PKEGDFRSNL ALGGKAETTK
LTPKEIEICN QIKPALQEEG LFFVGIDVIG GMLSEINVTS PTGIREVENL MNVPLADQVI
DYLIDHLNN