Gene NATL1_01021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01021 
Symbol 
ID4781182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp100202 
End bp101989 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content34% 
IMG OID640083365 
Producthypothetical protein 
Protein accessionYP_001013931 
Protein GI124024815 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0652] Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.273299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC CACTAATACA AGGTGCCTCT GGTATGGAGG GAGAAAAATT AACTTATGCT 
TCTACTAATG AAAATAATAA GGAAATATAT ACTTTTACTG CTAACGAGCC AGTTACTTGG
TCCATAAGTG GCGGAGAGAA ACACCTCTTT TCGATTGATC AAGATACTGG AAAATTAAGT
TTTAAAGATG TTCCTGATTA TGAAACGATC AAGAGCTTAA ATGGAACAAC TGTAGAATTT
CATACTAACT TTTCAACGGC CAGTGTCGGT TCAAAGTTTT TTGTAGAGGT TTATAACGAT
CAAAATCAAA CTAATAAAAC TACACCTATT ACAACAAATA ACTTTATTGA ATATGTAAGT
GACGGTTCAT ACGATAATAC TTTAATTCAT AGATTAGTTT CTGATTTTGT TATTCAGGGT
GGTGGATACA CATGGCCATC TTTAGCATCT AATGAAAGTG GTGGCTATCC ATTAACAGTC
AAATCGAAAG GTGAAATAAT TAATGAACCT ATTAATTCAA ATCTAATGGG TACTATTGCA
ATGGCAAAAG TTTCTGGTCA GCCAAATAGT GCGACATCTG AGTGGTTTAT AAATTTATCC
GATAATATTA ATCTTGATTC TCAAAATGAG GGGTTTAGTG TCTTTGGTCA TCTATTAGGA
GATAGTATTA ATAATCCACT TTTATTAAAT AACCAAACAA AGTATAATGT AAATTTTTCT
GATGTTGGGC TGAATATACC CGAGTTACCT TTAATTAACT TACAGGGAAA TGTTATAAAT
ATTGCGAATT ATTTTGCGAT TCATAAAGTT TCTACAATTA GCCAACGTCC TAGTGAAATT
GAGAATGTAT TTAATGTAAT CGTGACTGCT AATGATTCAC TTGGAAATCA ATCAAATCAA
TATGTAGTCG TTAATGTTAA AGATATCCAA GGAGAGGTTC TTGATGGCAT AGATGGACCA
GATGTTCTCA AGGGAGGCTT GGGAAATGAT ACTTTTAAAG GGAATGGTGG AAACGATACG
ATTGATGGAG GCAGTGATTT TGATATAGCT ACTTACTCAG GTAATTTTTC TGATTACACT
TTTACCATCG CTAATAAAGT TGTTACCATT AGCGATAACC GTTTATCGGA AAATGATGGA
ATAGATACAT TGTCTAATAT CGAGAAACTT ACTTTTGTTG ATAAAAATGC TTTAATCACC
AGTAAAGAAA TTAAAGCAAT TGATGTCTTA GGATTTCAAG CAGAAAAAGT TTATTCAGGC
AAAAGTGATT CTTATAAATT TTATGATTTA GGAGGTAATA ACTATGGTGT TGGGACTTCT
ACTGGTATTG ATCAGTTGAC TGGTGAATCT ATTCTCAAAT TTGATGATAA AAACATGAAT
TTAAAGCATG ACATCAAAGC AACATTTGAT CAAGTAACGG GTTTAGATAC AGATTCTGGA
AAAATGTTCC GACTATACAA CGCCTCATTT AAACGTCTAC CTGATCCAGA TGGATTACGA
TATTGGATCA GTAATTTTAG TTCTGGTAAA GATGATGAAA GAGCAGTGGC TTCATCATTT
TTAGCCTCTG CAGAATTCAA GGAGCGTTAT GGGGAAGACG TCTCCAATGA AAGCTATGTG
AACACTCTTT ATATCAATGT TTTAGGTAGA GATTACGACC AGGCTGGTTA TAATTACTGG
TTAGGTAATC TGAATAATGG TGTTGAGACC AAGTATGAAT TGCTATTGGG GTTTTCTGAA
TCAGTGGAAA ACAAAGGACT TTTTTCTGAG ATGACTGGTT TCTATTAA
 
Protein sequence
MTAPLIQGAS GMEGEKLTYA STNENNKEIY TFTANEPVTW SISGGEKHLF SIDQDTGKLS 
FKDVPDYETI KSLNGTTVEF HTNFSTASVG SKFFVEVYND QNQTNKTTPI TTNNFIEYVS
DGSYDNTLIH RLVSDFVIQG GGYTWPSLAS NESGGYPLTV KSKGEIINEP INSNLMGTIA
MAKVSGQPNS ATSEWFINLS DNINLDSQNE GFSVFGHLLG DSINNPLLLN NQTKYNVNFS
DVGLNIPELP LINLQGNVIN IANYFAIHKV STISQRPSEI ENVFNVIVTA NDSLGNQSNQ
YVVVNVKDIQ GEVLDGIDGP DVLKGGLGND TFKGNGGNDT IDGGSDFDIA TYSGNFSDYT
FTIANKVVTI SDNRLSENDG IDTLSNIEKL TFVDKNALIT SKEIKAIDVL GFQAEKVYSG
KSDSYKFYDL GGNNYGVGTS TGIDQLTGES ILKFDDKNMN LKHDIKATFD QVTGLDTDSG
KMFRLYNASF KRLPDPDGLR YWISNFSSGK DDERAVASSF LASAEFKERY GEDVSNESYV
NTLYINVLGR DYDQAGYNYW LGNLNNGVET KYELLLGFSE SVENKGLFSE MTGFY