Gene NATL1_00331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00331 
SymboldhsS 
ID4780297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp32708 
End bp33862 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content35% 
IMG OID640083296 
Productsoluble hydrogenase small subunit 
Protein accessionYP_001013862 
Protein GI124024746 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.417582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGACA AACTCAATTT AATGATTCCT GGACCAACAC CGGTCCCAGA AAATGTTTTG 
AGTTCTATGA GTAAACACCC CATTGGTCAT AGAAGTGGAG ATTTTCAAAA AATTGTTCAA
AAAACAACTG AACAACTCAA ATGGCTTCAC CAAACAACTG CAGACGTCCT AACAATTACA
GGAAGTGGAA CAGCTGCAAT GGAGGCAGGA ATAATTAATA CATTAAGCAA AGGTGATCAA
GTCATTTGCG GCGACAATGG AAAATTTGGT GAAAGATGGG TAAAAGTAGC AAGGGCATAT
GGATTAGATG TAAAAGTTGT AAAAGCTGAT TGGGGAACCC CTCTTGATCC AAATCAATTT
AAGAGGATTC TTGAAGAAGA CACCAATGAA AAAATTAAAG CGGTTATTTT AACTCATTCA
GAAACTTCAA CAGGAGTGAT TAATGATCTT AAATCGATTA ATAACGAAGT AAAAAATCAT
AGTAAAGCTA TTACAATTGC GGATTGTGTA ACAAGTCTTG GTGCATGTAA CATCCCAATG
GATGAATGGG GAATTGATGT AATAGCTTCA GGCTCTCAAA AAGGTTATAT GATTCCACCT
GGCCTGAGTT TTGTTGCTAT GAGCAAAAGA GCATGGGAAG CAAATAATCA ATCAAATTTA
CCTAAGTTTT ACTTAGATCT AAAACAATAT TTAAAGACAG TTAATCAAAA TAGTAATCCT
TTTACGCCTG CAATAAATTT ATACTTTGCT TTAGAAGCTT CACTAACAAT GATGCAAAAA
GAAGGGTTAA ATAATATATT TGCCCGCCAT GCTCGTCATC AAAAAGCAAC GCAAGAAGGA
ATAAAAGCAA TGGGTTTGAA TTTATTTACA AAAGAAAATT TTGGAAGTCC AGCAATAACA
GCTGTTAAGC CTGAAAATAT TGACGCTGAA AGTATAAGAA AGGCAATAAA AAATGACTTC
GACATACTCC TTGCTGGAGG TCAAGATCAT TTAAAAGGAA AAATCTTTAG AATTGGACAT
TTAGGATTTG TCAATAATAG AGACATTATT AGTGTCATAT CAGCTTTAGA AAGCACTCTT
GATAAAATGG GCAAACTAAA CGTCCCCATT GGCCAAGGAA TTGCAAAAAC AATTTCAGTA
CTAAATAACG AATAA
 
Protein sequence
MQDKLNLMIP GPTPVPENVL SSMSKHPIGH RSGDFQKIVQ KTTEQLKWLH QTTADVLTIT 
GSGTAAMEAG IINTLSKGDQ VICGDNGKFG ERWVKVARAY GLDVKVVKAD WGTPLDPNQF
KRILEEDTNE KIKAVILTHS ETSTGVINDL KSINNEVKNH SKAITIADCV TSLGACNIPM
DEWGIDVIAS GSQKGYMIPP GLSFVAMSKR AWEANNQSNL PKFYLDLKQY LKTVNQNSNP
FTPAINLYFA LEASLTMMQK EGLNNIFARH ARHQKATQEG IKAMGLNLFT KENFGSPAIT
AVKPENIDAE SIRKAIKNDF DILLAGGQDH LKGKIFRIGH LGFVNNRDII SVISALESTL
DKMGKLNVPI GQGIAKTISV LNNE