Gene NATL1_07361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07361 
Symbol 
ID4781250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp677452 
End bp678756 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content32% 
IMG OID640084011 
Productcarboxyl-terminal processing protease 
Protein accessionYP_001014559 
Protein GI124025443 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00775451 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGAAGA GATTTTTAAA AATATGTTTA CTTATAGCTA TTTGCCCTTC ACCTTCTTTC 
TCTTTCCAGG CTAATTCCTC TACTTTGATT ACTAACAATC CAAAGGAGAT TATTGATCAG
GTATGGCAAA TTATATATCG CGACTTTTTG GACTATTCAG GAAAATATAA GGCAGAAGAT
TGGATTAAAT TAAGAAAGGA AATACTATCA ACCAAATATT TTGATAATGA CGAAGCATAC
ATTGCCATTA AAGATATGTT GACAGAATTA GATGATCCTT ATACCAGATT TTTAGATCCT
AAAGAATTCA ATGAAATGAG AATAGATACA ACGGGAGAAT TGATGGGAGT AGGTATTCAA
ATTTCTCTAG ATGAAGTTAG TAATCAAATT GTTGTCGTAT CACCAATAGA GGGAACCCCA
GCCTTTCTCG CGGGAATTAA ACCGAAGGAT ATAATTGTAT CTATAGATGG TAAGGCTATC
GATGGTCTGA GCATAGACAG TACTGTTAAA CTTATTCGAG GTAAAAAAGG AACGAAGGTT
GAACTAGGTA TTATTAGAGA CGAGGAGTTA TTAAATATCT CATTAATACG AGATAGAATT
GAAATTAATG TTGTTGATAG TCGAATAAAT AATACAGTTT CAGGCGCGAA AATTGGTTAT
GTAAGGTTAA AACAATTTAA TGCCAAATCT CCAAAAGAAA TGAGTTTATC TATTAATAAA
TTAGAAAAAC AACAACCTTT TGGTTATGTA TTAGACCTTA GAAGTAATCC TGGTGGTTTG
CTTGAGGCAA GTATTGAAAT AGCGAGACAA TGGATAAATA CAGGAATTAT TGTTAGTACT
AAAACAAAAG ATGGTATTAC TGATATTCGG AAAGCAAAAA GTAGAGCCTT AACTAACAGA
CCAGTTGTGG TTTTAATCGA TGAAGGATCT GCGAGTGCTA GTGAGATTCT CTCTGGAGCA
ATTAAAGATA ATAAAAGAGG AGTATTGGTT GGGAAAAAGA CTTTTGGTAA AGGACTAGTT
CAGTCTGTCA GGTCTCTTTC TGATGGCTCA GGTCTAACAG TTACGGTCGC AAAATATTTA
ACACCAAGTG GTAAAGATAT TAATAAAAAT GGAATAGCAC CAGATATCAG AGCAGATCTT
TTGTTAAATG AAAAAAACAA ATTAACAAAT GCAGATCTAG GAACTTTAAA AGATAGTCAA
TATGTTGCGG CTGAAAATAT ATTACTTAAA AAGTTTAAAA TTGAGAGTAA TAAAAATTCT
TATAATCCCT TGAAATCAAA TTTGGGCTAT GCCCTAAAAA ACTAG
 
Protein sequence
MLKRFLKICL LIAICPSPSF SFQANSSTLI TNNPKEIIDQ VWQIIYRDFL DYSGKYKAED 
WIKLRKEILS TKYFDNDEAY IAIKDMLTEL DDPYTRFLDP KEFNEMRIDT TGELMGVGIQ
ISLDEVSNQI VVVSPIEGTP AFLAGIKPKD IIVSIDGKAI DGLSIDSTVK LIRGKKGTKV
ELGIIRDEEL LNISLIRDRI EINVVDSRIN NTVSGAKIGY VRLKQFNAKS PKEMSLSINK
LEKQQPFGYV LDLRSNPGGL LEASIEIARQ WINTGIIVST KTKDGITDIR KAKSRALTNR
PVVVLIDEGS ASASEILSGA IKDNKRGVLV GKKTFGKGLV QSVRSLSDGS GLTVTVAKYL
TPSGKDINKN GIAPDIRADL LLNEKNKLTN ADLGTLKDSQ YVAAENILLK KFKIESNKNS
YNPLKSNLGY ALKN