Gene NATL1_03541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03541 
Symbol 
ID4780061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp326238 
End bp327506 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content35% 
IMG OID640083621 
Producthemolysin-like protein 
Protein accessionYP_001014183 
Protein GI124025067 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTAT TACTTCTTTG TATATTGCTT GTAATACCAG CATTTTTCAA TGCAGGTCAA 
TTTGCCATAT TGCGACTTCG TTCAACTAAG GTTCAAAGAC TTGTAGAAGA TGGACTACCT
GGTTCCAATT CCATAATTCG TCTGCAGAAA AGGCTGAGAA GAACCCTTTT AATAGCTGAG
TTAGGAATAA CTATTTCATT GATTTCAATT GGTTGGATCT GCAAAAATTT TGCAAGTCAA
TGGTGGGGAA ATAATGCTTC AATTAACTAT TTGTTAGATC TAGGTCTTTT TATTACTGTT
GTACTTTTAG CTACTCTGAT ATCTGGTCTG CTCCCAAAAG CACTTGTTCT TAATCAACCA
GAGACTTCTG CCCTGAAATT ATCGCCTCTA ATTGAAGCAG CGATAAAATG GATGTCTCCA
TTCCTTTCTT TGCTTGAAGG ACTTGCTTTA TTAATTCTTA GATTAGTTGG ACTTAATACT
CAATCAGAAA GTCTTACTAC AGCTGCATTC TCTGCAGGGG AATTAGAAAA ATTAATTGAA
ACTGGTGGCG TAACTGGATT AAAACCTGAT GAAAGAAACA TACTTGAAGG TGTTTTTGCT
TTAAGAGATA CACAAGTCAG AGAAGTAATG GTTCCCAGGT CTGGAATGGT TACTCTTCCA
AGAGAAGTTT CCTTCACTCA AATGATGGAA GAAGTTCATA AAACTCGTCA TGCCAGATAC
TTAGTAATTG ATGATTCACT TGATAATGTT CTTGGAGTTT TAGATTTAAG GCAACTTGCT
GATCCAATAG CTAAAGGGGC AATGCAAGCT AACTCCTCTT TGGAGCCATA TATAAAGCCA
GTTGTTCGTG TTTTAGAAAC TTCTACTTTG GCCGAATTGC TACCCCTAAT AAAAAATGGG
AATCCACTAC TTTTGGTCGT TGATGAATAT GGAGGAACTG AAGGATTAAT AACATCAGCA
GACTTAACAG GTGAAATTGT TGGTGACGAG ATTCAATTTG ATAATAAAGA ATCTGAGTTA
AGATCTCTTG ATGACTTAAA AAAAATCTGG CTTACTTCTG GCGAAATAGA AGTAATAGAA
CTAAATAGAG AGCTTAACTT AAAATTGCCA GAAGCTGATG ATCATTACAC ACTTGCTGGA
TTTGTTTTAG AAAAACTCCA AGAAATTCCA AGCTCAGGAG AAACGTTCAT TCATAATGAA
ATTGTATTTG AAATTATCTC CATGAAAGGT CCAAGAATCA ACAAAGTAAA AATAATCCTT
CCTAAGTAA
 
Protein sequence
MSLLLLCILL VIPAFFNAGQ FAILRLRSTK VQRLVEDGLP GSNSIIRLQK RLRRTLLIAE 
LGITISLISI GWICKNFASQ WWGNNASINY LLDLGLFITV VLLATLISGL LPKALVLNQP
ETSALKLSPL IEAAIKWMSP FLSLLEGLAL LILRLVGLNT QSESLTTAAF SAGELEKLIE
TGGVTGLKPD ERNILEGVFA LRDTQVREVM VPRSGMVTLP REVSFTQMME EVHKTRHARY
LVIDDSLDNV LGVLDLRQLA DPIAKGAMQA NSSLEPYIKP VVRVLETSTL AELLPLIKNG
NPLLLVVDEY GGTEGLITSA DLTGEIVGDE IQFDNKESEL RSLDDLKKIW LTSGEIEVIE
LNRELNLKLP EADDHYTLAG FVLEKLQEIP SSGETFIHNE IVFEIISMKG PRINKVKIIL
PK