Gene NATL1_15171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15171 
Symbol 
ID4779092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1230968 
End bp1232947 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content34% 
IMG OID640084799 
ProductHD superfamily hydrolase 
Protein accessionYP_001015339 
Protein GI124026223 
COG category[R] General function prediction only 
COG ID[COG1480] Predicted membrane-associated HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.450871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTGTC TATTAGTAGC AATATTTTCA AGTTATAAGT TACTTGCTGT TCCGGATCTC 
AAACCTGGAG ATATTGCTCA AGTCAATGTA ATAGCCCCTA GAGATGCAAA GGTAATAGAC
ACAACGGATT TAAAAGAAAA GAAACAAGGC TTAAAAGAAA GTTTTGTACA ATCAATAGAC
AAGAATAAAT CAAGTGATTT AGAAAAGACT GTTTCCAAGC AAATCAATAT ACTTCGTACT
CAAAAATTCA ACAATTTTGG AATCGATTTC AATGAATTCA ACATAACAAC TCTAGAGAAA
GATTGGATAT TAAATGTTAA AGACAATGAA TGGGAAGAGT GGAAAAAAGA GATCAAAAAT
GTTTCAAAAA AAATGCTTTC TCAGGGAATT ATTAATACAC TTGCACTTGA TCAACTTAAT
GAAGCTTCTT CACTTCAATT AATAGATTTA GGTGAGAAAG ATTCTCCAAA TAGATCATTA
GGTGCAAAAA TATTATCAAA TAGTTTTCAT CAAAAAAGTA ATTTAAAAAT TGATAAACTA
AAAACTAATA TATTACTCGA AAATCTAATC AATCAAGAAG GTATAAACAC AATAAATGTT
AAAGAGGGCA GTATAATATC AAGGAAAGGT AAGCCAATAA CTTCACAAGA GTTTGATATC
CTAGAACATT TTAATAAAGT AAGTCGAAGT CCTAGGCCCT TAAAGTGGTT AATTACCTTC
TCAGAATCCA TGGGAAGTTG TGGATTACTT CTTATGATCA TGAGAAGAGA AAAGCCTAGA
CTTCAGGCCA GGCACGGATT ACTATCCTTA ACTTTATTAC TTGTAGTTCA ACTAACAAAA
GATTGGCTTG GGCCTATAGC AAGTCCCATG CAATTAATAT TACCTCCCAC ATTGCTTCTT
TCTCAAGGGA TAGGCACCAT AACATCATTG GCTTGGATGG CAGCTGCTAG TCTGATTTGG
CCATCATCTC TTGGTGAATC AATTGAAGTC AGACTAATAA TTGCTTGTAT CGCTGGTTCA
TTTATTGCAT TTTTAGGCAG AAGGATGAGA AGCCGAGCAC AGGTTCTTCA AATAGCTGTA
TTTATTCCTT TTGGTGCATT ATTAGGGCAA TGGTTTATTC TTAATCAGGT AATTAAAGAA
AATAATATAG AATTTAACAA TCTATCTATT GACCCTAATT CTCTTTTTAA TGAGACCATT
ATTATTAGTT CAATATTAAT GGTAACAATA TTAATTATTC CAATTCTAGA AAATACATTT
GGATTACTTA CTAGAGCAAG ATTAATGGAA CTTGCTGATC AAGAACGTCC TTTACTTCGT
AGATTATCTA GAGAAGCTCC AGGGACGTTT GAACATACTT TAACAATTTT AAGCCTCGCC
GAGGAAGGAG CAAGAGTTAT TGGAGCTGAT GTTGACTTAA TAAGAACTGG GGCTTTATAC
CATGATGTAG GGAAATTGCA TGCTCCAAAT TGGTTTATTG AAAATCAGAA AGATGGAATA
AACCCACACG ATGAAATAAA AAACCCTTAT AAAAGTGCCG ATATTCTTCA AGCCCATGTC
GATGAAGGAT TGAAGCTTGC GAGGAAATAT CGACTTCCAT CTCCTATTGC TGATTTCATC
CCAGAGCATC AAGGTACTTT AAAGATGGGA TATTTTCTTC ATAAAGCTAG AGAAAGTGAT
CCTTCAGCCT CTGAAAAACG TTTTAGATAT AAAGGACCTA TTCCTCATTC AAAAGAAACT
GGAATACTCA TGCTTGCAGA TGGCTGCGAA GCAGCATTAA GAGCTCTTGA CTCTTCTTCT
TCAGATAAAG ATGCATGCAA AACAGTTAGA AAGATCATCC AATCTCGTCA GGTTGACGGT
CAATTAAAAG AAAGTAGTTT AACCAGAGCA GAAATAGAAA TAATTCTTAG AGCCTTTGTC
TCTGTATGGA GGAGAATGCG TCACAGACGC TTAAAATACC CAAGCTTCAA TCCAAGGTGA
 
Protein sequence
MVCLLVAIFS SYKLLAVPDL KPGDIAQVNV IAPRDAKVID TTDLKEKKQG LKESFVQSID 
KNKSSDLEKT VSKQINILRT QKFNNFGIDF NEFNITTLEK DWILNVKDNE WEEWKKEIKN
VSKKMLSQGI INTLALDQLN EASSLQLIDL GEKDSPNRSL GAKILSNSFH QKSNLKIDKL
KTNILLENLI NQEGINTINV KEGSIISRKG KPITSQEFDI LEHFNKVSRS PRPLKWLITF
SESMGSCGLL LMIMRREKPR LQARHGLLSL TLLLVVQLTK DWLGPIASPM QLILPPTLLL
SQGIGTITSL AWMAAASLIW PSSLGESIEV RLIIACIAGS FIAFLGRRMR SRAQVLQIAV
FIPFGALLGQ WFILNQVIKE NNIEFNNLSI DPNSLFNETI IISSILMVTI LIIPILENTF
GLLTRARLME LADQERPLLR RLSREAPGTF EHTLTILSLA EEGARVIGAD VDLIRTGALY
HDVGKLHAPN WFIENQKDGI NPHDEIKNPY KSADILQAHV DEGLKLARKY RLPSPIADFI
PEHQGTLKMG YFLHKARESD PSASEKRFRY KGPIPHSKET GILMLADGCE AALRALDSSS
SDKDACKTVR KIIQSRQVDG QLKESSLTRA EIEIILRAFV SVWRRMRHRR LKYPSFNPR