Gene NATL1_05931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05931 
Symbol 
ID4779892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp538982 
End bp540199 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content31% 
IMG OID640083870 
ProductZn-dependent proteases 
Protein accessionYP_001014420 
Protein GI124025304 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.111882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA GAGGTATCCC TTTGAGGATC CATCCAAGTT GGTTTTTGGT TTACTTGTAT 
TTTACTTTGT CATCTAAAGA TCAGTTCGAG ACGCTTTTGA ATGGTCAAGC AACTATTTGG
AATGGATGGG TGATTGGTGC TTTTACCTCT TCTCTTTTGT TTTTATCTGT TTTATTGCAT
GAATTGGCTC ATTCTTTTGT AGCAATTGGA GAGGGTCTAA AAGTTAGAGA CATAACACTT
TTTTTTCTTG GAGGTATGGC AAGTCTTGAA AAGGAATGTC CGACTTCAAA AGGAAGTTTA
AAAATTGCCA TTTCAGGTCC TGTTGTTAGT CTTTTATTAG CTTTTTTAAT GATTTTATTA
AGTAATAATT TATCAGTATC GAATTTTATT CTCTCTAATT TATTTAAGCA GGTTGGAAGC
CTCAACCTTT TAATAGGTGT ATTTAATTTA CTTCCGATAA TTCCTCTAGA TGGTGGCGTA
ATATTAAAAT CTTTAATTTG GTACTTTACA GGGAGTAAAA GAGCAGGGAT TAAAGTTGCT
ATTGGCTCTG CAAGATTAAT TTCTTTTCTT GCTATTTTTA TTGGCTTTTT AAGTTTGGTT
AGGGGTAACT TATATCTTGC CATTTGCTTT TCTATTATTG GTTTATTTGT TTTTTCTTCA
TCTAAATCAC AGAGCCAAAT TATTCAAATA CAAAAGATAT TATCTGAATC ATATGTTAAT
CAGGTTTGTA GTCGTTCATT TAGGGTTCTA GAGGATGATT TGCCTGTGAA AGTTTTATCT
AAATATAGTT CATTTAATAA AGATAATTTT TTCAATGAAG TATGGATCCT TTTGTGTAGA
GAAGGGAGAT GGGTCGGTTA TGTGAATGAA AAAATCTTGA AGAATATTTC TGTACAAAAC
TGGGATAAAA AATTTCTTTA TGAATTCTCA CAACCAATAA ATGAATTGCC ATCTATTAGT
GAAAAAGAAT CATTATGGAA AGCAATATTA AAAATAGAAA AAACAAAAGA TGGAAGGCTA
CTTGTACTAT CATTTTCTGG TCTTCCTCTT GGAACTTTAG ATAGAGTAGA TATAGGTAAA
GCAGTACTTA AAAAAATCGG ATTAAACCTT CCAGACCAAT TAATTAAAAT TGCAAGAAAA
GATAATATTT ATCCACTAGG ATTAAATCTA CTTAATATTG CACAATCAAT GGATTCAAGT
GACTTGCTAG AGGACTAA
 
Protein sequence
MKIRGIPLRI HPSWFLVYLY FTLSSKDQFE TLLNGQATIW NGWVIGAFTS SLLFLSVLLH 
ELAHSFVAIG EGLKVRDITL FFLGGMASLE KECPTSKGSL KIAISGPVVS LLLAFLMILL
SNNLSVSNFI LSNLFKQVGS LNLLIGVFNL LPIIPLDGGV ILKSLIWYFT GSKRAGIKVA
IGSARLISFL AIFIGFLSLV RGNLYLAICF SIIGLFVFSS SKSQSQIIQI QKILSESYVN
QVCSRSFRVL EDDLPVKVLS KYSSFNKDNF FNEVWILLCR EGRWVGYVNE KILKNISVQN
WDKKFLYEFS QPINELPSIS EKESLWKAIL KIEKTKDGRL LVLSFSGLPL GTLDRVDIGK
AVLKKIGLNL PDQLIKIARK DNIYPLGLNL LNIAQSMDSS DLLED