Gene NATL1_16631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16631 
Symbol 
ID4779751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1355013 
End bp1356098 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content36% 
IMG OID640084946 
Productmembrane-associated Zn-dependent proteases 1 
Protein accessionYP_001015485 
Protein GI124026369 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.24545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTC TCTTATCTAT AGCTGTACTT GGCCTTCTGA TTTTTTTTCA TGAATCTGGT 
CATTTTTTAG CAGCAGTACT TCAAAAAATT AAAGTCAGTG GATTTTCAAT TGGTTTCGGA
CCAGCTCTTT TGAAAAAGGA AATAAATGGG ATTACTTATT CACTTAGATC GCTTCCTTTA
GGTGGATTCG TTTCCTTTCC TGATGAAGAA ACTGATTCAC TAGTTCAACC TAATGACCCA
GATCTTTTAA AGAATAGACC AATTCACCAA AGAGCAATAG TTATTTCAGC GGGTGTCATA
GCAAATTTAT TACTTGCTTG GATCGTACTT ATTGGTCAAG CAAGCTTTGT AGGAATTCCT
AATCAACCTG AGCCAGGAGT AATAATCATG GGAATCCAAC CAGATGAGCC TGCATTTAAT
TCTGGATTAG TGGCTGGAGA TCGAATAATG AGCGTAAACG GGAAAGAATT AGGAAGCGGT
AAGGAGGGAA TTATGAATTT AGTCAATATC ATTCAAAATT CATCGGGGGA AGAATTACTT
TTTGAGAGAG TTAATGAAGA AGCAAACGAT ACAGTTTCTA TAATTCCAGC TGAAAACGAA
GGAAATGGGA GGATAGGAGC TCAATTGCAA CCAAATCTTA CTAATGAAGT ATCAAAAGCA
AAAAATATTG GAGAAATATT TAATAGCTCG AATTCACAAT TTTATGAATT ACTAAGTCGA
ACAGTTATTG GCTATAAAAG CTTGATTACT AATTTCTCTT CAACGGCTCA GCAGTTAAGT
GGTCCAGTCA AAATTGTTGA AATTGGAGCT CAGCTCTCAG AGCAAGGGGG CTCAGGTCTT
ATACTATTTT CTGCTTTGGT TTCAATTAAC CTTGCAGTTC TTAACTCGTT ACCGTTGCCA
CTTCTAGATG GAGGACAACT TGTACTTCTA ATTCTAGAAA GTATCAGAGG GAAGCCTGTT
CCTGAAAAAA TTCAATTAGC TTTTATGCAA TCAGGATTTG TTTTACTTGT AGGACTAAGT
GTTGTTTTGA TAATCCGAGA TACTACTCAG CTAGCTTTAG TTCAACAGAT TGTTCACAGA
CAATAA
 
Protein sequence
MNVLLSIAVL GLLIFFHESG HFLAAVLQKI KVSGFSIGFG PALLKKEING ITYSLRSLPL 
GGFVSFPDEE TDSLVQPNDP DLLKNRPIHQ RAIVISAGVI ANLLLAWIVL IGQASFVGIP
NQPEPGVIIM GIQPDEPAFN SGLVAGDRIM SVNGKELGSG KEGIMNLVNI IQNSSGEELL
FERVNEEAND TVSIIPAENE GNGRIGAQLQ PNLTNEVSKA KNIGEIFNSS NSQFYELLSR
TVIGYKSLIT NFSSTAQQLS GPVKIVEIGA QLSEQGGSGL ILFSALVSIN LAVLNSLPLP
LLDGGQLVLL ILESIRGKPV PEKIQLAFMQ SGFVLLVGLS VVLIIRDTTQ LALVQQIVHR
Q