Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01021 |
Symbol | |
ID | 4781182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 100202 |
End bp | 101989 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640083365 |
Product | hypothetical protein |
Protein accession | YP_001013931 |
Protein GI | 124024815 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0652] Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.273299 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGC CACTAATACA AGGTGCCTCT GGTATGGAGG GAGAAAAATT AACTTATGCT TCTACTAATG AAAATAATAA GGAAATATAT ACTTTTACTG CTAACGAGCC AGTTACTTGG TCCATAAGTG GCGGAGAGAA ACACCTCTTT TCGATTGATC AAGATACTGG AAAATTAAGT TTTAAAGATG TTCCTGATTA TGAAACGATC AAGAGCTTAA ATGGAACAAC TGTAGAATTT CATACTAACT TTTCAACGGC CAGTGTCGGT TCAAAGTTTT TTGTAGAGGT TTATAACGAT CAAAATCAAA CTAATAAAAC TACACCTATT ACAACAAATA ACTTTATTGA ATATGTAAGT GACGGTTCAT ACGATAATAC TTTAATTCAT AGATTAGTTT CTGATTTTGT TATTCAGGGT GGTGGATACA CATGGCCATC TTTAGCATCT AATGAAAGTG GTGGCTATCC ATTAACAGTC AAATCGAAAG GTGAAATAAT TAATGAACCT ATTAATTCAA ATCTAATGGG TACTATTGCA ATGGCAAAAG TTTCTGGTCA GCCAAATAGT GCGACATCTG AGTGGTTTAT AAATTTATCC GATAATATTA ATCTTGATTC TCAAAATGAG GGGTTTAGTG TCTTTGGTCA TCTATTAGGA GATAGTATTA ATAATCCACT TTTATTAAAT AACCAAACAA AGTATAATGT AAATTTTTCT GATGTTGGGC TGAATATACC CGAGTTACCT TTAATTAACT TACAGGGAAA TGTTATAAAT ATTGCGAATT ATTTTGCGAT TCATAAAGTT TCTACAATTA GCCAACGTCC TAGTGAAATT GAGAATGTAT TTAATGTAAT CGTGACTGCT AATGATTCAC TTGGAAATCA ATCAAATCAA TATGTAGTCG TTAATGTTAA AGATATCCAA GGAGAGGTTC TTGATGGCAT AGATGGACCA GATGTTCTCA AGGGAGGCTT GGGAAATGAT ACTTTTAAAG GGAATGGTGG AAACGATACG ATTGATGGAG GCAGTGATTT TGATATAGCT ACTTACTCAG GTAATTTTTC TGATTACACT TTTACCATCG CTAATAAAGT TGTTACCATT AGCGATAACC GTTTATCGGA AAATGATGGA ATAGATACAT TGTCTAATAT CGAGAAACTT ACTTTTGTTG ATAAAAATGC TTTAATCACC AGTAAAGAAA TTAAAGCAAT TGATGTCTTA GGATTTCAAG CAGAAAAAGT TTATTCAGGC AAAAGTGATT CTTATAAATT TTATGATTTA GGAGGTAATA ACTATGGTGT TGGGACTTCT ACTGGTATTG ATCAGTTGAC TGGTGAATCT ATTCTCAAAT TTGATGATAA AAACATGAAT TTAAAGCATG ACATCAAAGC AACATTTGAT CAAGTAACGG GTTTAGATAC AGATTCTGGA AAAATGTTCC GACTATACAA CGCCTCATTT AAACGTCTAC CTGATCCAGA TGGATTACGA TATTGGATCA GTAATTTTAG TTCTGGTAAA GATGATGAAA GAGCAGTGGC TTCATCATTT TTAGCCTCTG CAGAATTCAA GGAGCGTTAT GGGGAAGACG TCTCCAATGA AAGCTATGTG AACACTCTTT ATATCAATGT TTTAGGTAGA GATTACGACC AGGCTGGTTA TAATTACTGG TTAGGTAATC TGAATAATGG TGTTGAGACC AAGTATGAAT TGCTATTGGG GTTTTCTGAA TCAGTGGAAA ACAAAGGACT TTTTTCTGAG ATGACTGGTT TCTATTAA
|
Protein sequence | MTAPLIQGAS GMEGEKLTYA STNENNKEIY TFTANEPVTW SISGGEKHLF SIDQDTGKLS FKDVPDYETI KSLNGTTVEF HTNFSTASVG SKFFVEVYND QNQTNKTTPI TTNNFIEYVS DGSYDNTLIH RLVSDFVIQG GGYTWPSLAS NESGGYPLTV KSKGEIINEP INSNLMGTIA MAKVSGQPNS ATSEWFINLS DNINLDSQNE GFSVFGHLLG DSINNPLLLN NQTKYNVNFS DVGLNIPELP LINLQGNVIN IANYFAIHKV STISQRPSEI ENVFNVIVTA NDSLGNQSNQ YVVVNVKDIQ GEVLDGIDGP DVLKGGLGND TFKGNGGNDT IDGGSDFDIA TYSGNFSDYT FTIANKVVTI SDNRLSENDG IDTLSNIEKL TFVDKNALIT SKEIKAIDVL GFQAEKVYSG KSDSYKFYDL GGNNYGVGTS TGIDQLTGES ILKFDDKNMN LKHDIKATFD QVTGLDTDSG KMFRLYNASF KRLPDPDGLR YWISNFSSGK DDERAVASSF LASAEFKERY GEDVSNESYV NTLYINVLGR DYDQAGYNYW LGNLNNGVET KYELLLGFSE SVENKGLFSE MTGFY
|
| |