Gene NATL1_20651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20651 
Symbol 
ID4779320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1709997 
End bp1711181 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content39% 
IMG OID640085361 
Productzinc metallopeptidase 
Protein accessionYP_001015885 
Protein GI124026770 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGATT TAGGAAAAAA AATTGACGTT TTAACCAAGG ATATACTGCC TGATTTAATT 
CAATTACGTC GTCATCTCCA TGCTCATCCA GAGCTAAGTG GTCAGGAATA TCAAACTGCT
GCTCTTGTTG CCGGGGAGCT TAGAAAATCA GGTTGGGAAG TAAAAGAAGC AGTTGGCAAA
ACCGGAGTAG TAGCTGAAAT AGGTAAGAAA AGCGGACCTG TTGTTGGCTT ACGAGTTGAT
ATGGATGCTT TGCCAATTGA GGAGAGAACA GGTTTAGAAT ATTCTTCTTC AATTCAAGGC
CTGATGCATG CATGTGGCCA TGATCTCCAT ACTTGTATTG GTTTGGGGGT GGCTAAAGTA
TTAGCAAAAA ACAAATTTAC AAATTCTCGA ATCCGAATCA TTTTTCAACC TGCTGAAGAG
ATTGCTCAAG GTGCAAATTG GATGAGAGCT GAAAAGGTTC TTGAAGGTGT TCAAGCTCTC
TTTGGTGTGC ATGTTTATCC AGATTTGTCA GTTGGCAAGA TTGGAATAAA AACTGGAACT
TTTACAGCTG CCGCTGCTGA ATTGGAAATA GAGATTATTG GTGATGGAGG GCATGGAGCT
AGACCACATG AAGGCATAGA TTCAATTTGG ATTTCTGCAA AAGTTATTAG TGGACTTCAA
GAGGCTATTA GTAGACGTTT AGATGCGCTT AAGCCTGTAG TTATTAGCTT TGGGAAGATT
TCAGGAGGTA ATGCTTTCAA TGTAATTGCT GAGAGGGTTA AGCTTCTAGG TACAGTAAGG
TGTCTTGATA ACAACCTTTA TGAAAAATTG CCTCAATGGA TTGAGAAAAT AGTACAAAAT
ATAGCCTCTA CTCACGGAGG TAAGGCGAAC ATAAAATTTA AGTCGATCGC GCCCCCAGTT
TATAACGATC CAGAGTTGAC TAGTTTGTTA TCTACCTGTG CGAAGAATTT TATGGATGAA
GAAAATATTG TTTTTTTAGA AAATCCGTCA TTAGGAGCTG AAGATTTTGC TTTCTTCTTG
CAAGATGTTC CAGGCACGAT GTTTAGATTA GGAGTGGCTG GCAATCAAGG TTGTGCTCCA
TTGCACAGTG GAAACTTTTC TTTGGATGAA AGAAGCCTAG AATTAGGAAT AAAAATTTTG
TCTCAAACGT TAGTCATGGC ATCTAAAACC CTTCAAGACA TTTAG
 
Protein sequence
MKDLGKKIDV LTKDILPDLI QLRRHLHAHP ELSGQEYQTA ALVAGELRKS GWEVKEAVGK 
TGVVAEIGKK SGPVVGLRVD MDALPIEERT GLEYSSSIQG LMHACGHDLH TCIGLGVAKV
LAKNKFTNSR IRIIFQPAEE IAQGANWMRA EKVLEGVQAL FGVHVYPDLS VGKIGIKTGT
FTAAAAELEI EIIGDGGHGA RPHEGIDSIW ISAKVISGLQ EAISRRLDAL KPVVISFGKI
SGGNAFNVIA ERVKLLGTVR CLDNNLYEKL PQWIEKIVQN IASTHGGKAN IKFKSIAPPV
YNDPELTSLL STCAKNFMDE ENIVFLENPS LGAEDFAFFL QDVPGTMFRL GVAGNQGCAP
LHSGNFSLDE RSLELGIKIL SQTLVMASKT LQDI