Gene NATL1_15401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15401 
Symbol 
ID4779652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1253877 
End bp1255472 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content33% 
IMG OID640084822 
Productputative dienelactone hydrolase 
Protein accessionYP_001015362 
Protein GI124026246 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAACAA ATTCATTCAT TAAAAAATTA GTTGTTCTTG GAACAGTTGG AAGCTTACTA 
GCTCCTTATT GTTTTTATCC AAAACTTCAA GCAGCAGAGA GATTTGAAAT TCATTTCGAT
GGAATGTCCA TCCCAATTTC GATAAAAGAA TTGATTGATT GGAGTAATGG TGAAGAGGAA
AAAAATTCTG AATTAGCTAG TTGGCTTAAT TTACTTGGCT TTAAGGAAAG GAAAGGTTTG
GCAAAGTTTT TAAGTACACC GTTGGTAAGA GATAAAAGTA TGGCAAGACA AATTTTGAGA
AGTTGGTCAG GACGAAAATT GCTTGATGAA GTAAGTGATC TAATACTTAT GGATGAAGAC
AGCTCAGGCG AAAGTGTTCT CGATACTCTA GAAAAACTTT TAAATGAAAA AGATGAGGTG
ACAACTTTTG ATCTTTTAAA TGCTCTCTCT GTTAAAGCAA TTCACATTGA TTTGGATGGG
TGGATTGAAG TAGCTAATAA TTGGAGAAGT GAGTTAAACA AACAACAGAA ACTCATAACT
GATTTAGTGT CAATTAATGA TTTATCAGTT ACCAGAGAGA CAATGAATGT TTTACCTCTT
GAAATAAAAG AAACCGAATA TGAATTAATT TCCTTAACTG TTTCTCATCG AAAAGAGCCT
TTAATTTTAG AGGTTTGGAA CCCATCTTTT AGGAAAAAAA ATAGAAAAAA TTGGGTTCTT
TTAATGCCTG GTTTAGGAGG AGATCGTAAT CATTTTAATT GGCTTGCAAG AAGTCTTAGT
CACAATGGTT GGCCTGTTGT TGTCTTGGAT CATCCAGGTA GTGACTCATT AGCATTGGAA
GCCTTGGTAA AAGGAAGACT ACCTTTACCA GGTGCTGAAA TAATTCCTGA ACGTTTGAAC
GATATCCATA GCATCCTCAA GGCAAAAAAA TCAGGAACAA TTGATTTATT GGCAGAGAAT
GTTGTCTTGA TGGGGCATTC GTTAGGAGCC CTCACAGCTA TTTTGGCTTC AGGAGTAAAA
ATAGATGATC AACTTGAAAA TAGATGTCAG GAGGTACTTG ATAATCTTTC TCTTTCTAAT
TTATCTTCAC TTTTACAATG TCAACTAATA GATATTACTT TGTCAGATAC TAATGGTATA
GAAAATCTTT CAGCTATTGT TGGTATGAAT AGTTTTGGGA GTTTTTTGTG GCCAAATAAT
TTAGAAAAAA AAATAAATAT TCCTCTTTTC CTTACAGGAG GAACTTTTGA TTTAGTTACT
CCTTCTATTA GTGAACAACT AGGATTAATG CTTGCTTTGA GTTCAAGCCC ATTAAGTAGA
GTCCTTTTAA TTGAGAGAGC TAGCCATTTT TCACCTATTA GAGTAGAGGG ACAAATGAAT
CAGTCTAAAG GTAAGGATTT ATTTAATCTA GGAGAATCAA TAGTTGGATA TCATCCACTT
TCTGTTCAGA GCTTATTAGC TTTTGAGATC ATCAACTTCC TAGAAAAATT AGAAGAGAAT
AAAACAGTCC CTTTGAATAC GAATTTAACT AAAGGCGAGC TTAAGTTTCA TATCTTAGAC
AGTAATATAA TTGAACAACT TATCAATATT CAATAA
 
Protein sequence
MRTNSFIKKL VVLGTVGSLL APYCFYPKLQ AAERFEIHFD GMSIPISIKE LIDWSNGEEE 
KNSELASWLN LLGFKERKGL AKFLSTPLVR DKSMARQILR SWSGRKLLDE VSDLILMDED
SSGESVLDTL EKLLNEKDEV TTFDLLNALS VKAIHIDLDG WIEVANNWRS ELNKQQKLIT
DLVSINDLSV TRETMNVLPL EIKETEYELI SLTVSHRKEP LILEVWNPSF RKKNRKNWVL
LMPGLGGDRN HFNWLARSLS HNGWPVVVLD HPGSDSLALE ALVKGRLPLP GAEIIPERLN
DIHSILKAKK SGTIDLLAEN VVLMGHSLGA LTAILASGVK IDDQLENRCQ EVLDNLSLSN
LSSLLQCQLI DITLSDTNGI ENLSAIVGMN SFGSFLWPNN LEKKINIPLF LTGGTFDLVT
PSISEQLGLM LALSSSPLSR VLLIERASHF SPIRVEGQMN QSKGKDLFNL GESIVGYHPL
SVQSLLAFEI INFLEKLEEN KTVPLNTNLT KGELKFHILD SNIIEQLINI Q