Gene NATL1_12631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_12631 
Symbol 
ID4781130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1084267 
End bp1085217 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content34% 
IMG OID640084542 
Producthydrolases or acyltransferases 
Protein accessionYP_001015086 
Protein GI124025970 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTTC CGGAAATAGA ACCTAACGAA AAAGGGATGT TAAATGTCAG TCCACTACAT 
TCCATCTATT GGGAAAGAAG TGGTAATCCC TATGGTTTGT CTGTCTTAAT TATTCATGGC
GGCCCAGGAG GAGGTAGCAG TCCTTCTTAT CGAAGATATT TTGATCCAAA AAAATTTAAT
ATTGTTCAAT TTGATCAAAG AGGATGTGGT AGATCTACTC CACATTCTGA GTTAGAGGAG
AACACCACTC ATCATTTAAT TGAAGATATT GAAAAGATAA GACAGCTCTT GAAAATTGAG
TCATGGCATG TTTTTGGAGG CTCATGGGGG TCAACCTTAA GCTTGATTTA TGCCATACAA
CACACAGAAA AAGTTTTAAG TCTTACTCTT AGGGGGATAT TTTTATGCAG ACAACACGAA
TTAACTTGGT TCTATCAAAA AGGTGCTAGT GAGATTTTCC CTGAAGAATT CGATCTTTAT
CAATCTGTCA TTCCGCTAAA TGAGAGAGGG AATTTGATTA ATGCTTTTCA TAAGAGATTA
ACAAGTCAAG ATAGGTCTGA GAGAACTCAA GCGGCACATG CTTGGACGAG ATGGGAAATG
TCAACTAGCT ATCTCAAGCC AAAAGAATTA TCAATTAATA AAGCTACTAA TGATAATTTC
TCAGACTCTT TTGCGCGTAT AGAATGTCAT TATTTTATTA ATAATATTTT TTTAGAGGAG
AACTATATTC TAAAAAATAT TAATAAGCTA AAAGGTATTC CTGTTTCGAT TGTTCAGGGA
AGATATGACG TTGTTTGTCC AATGAGAAGT GCATGGGATC TTAATAAAGC ATTGCCCACT
TCTAAACTTT ATGTAATCGA TAATGCAGGA CATTCAATGA AAGAAATTGG AATTTCTAAA
AGATTAATTG AATTGACAAA TGAGTTAGCC AATTCTTTCT CTAATCTCTA A
 
Protein sequence
MLFPEIEPNE KGMLNVSPLH SIYWERSGNP YGLSVLIIHG GPGGGSSPSY RRYFDPKKFN 
IVQFDQRGCG RSTPHSELEE NTTHHLIEDI EKIRQLLKIE SWHVFGGSWG STLSLIYAIQ
HTEKVLSLTL RGIFLCRQHE LTWFYQKGAS EIFPEEFDLY QSVIPLNERG NLINAFHKRL
TSQDRSERTQ AAHAWTRWEM STSYLKPKEL SINKATNDNF SDSFARIECH YFINNIFLEE
NYILKNINKL KGIPVSIVQG RYDVVCPMRS AWDLNKALPT SKLYVIDNAG HSMKEIGISK
RLIELTNELA NSFSNL