Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18201 |
Symbol | |
ID | 4780069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1484472 |
End bp | 1485764 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640085109 |
Product | hypothetical protein |
Protein accession | YP_001015640 |
Protein GI | 124026525 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR01444] methyltransferase, FkbM family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.48017 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCTA GTGATGAAGA TCAAGGGGAA AAGAATAAAC CTCAAGTCCA AATATTTACT GTTCCATTTA CTTTAGGTGA AATCAAAGAA AATATTAATA TTAATATTAA TACTCCTTCT AAACCTTCTA AAGAGCAAAT CATTAATCAA GCATTTAAAT TTCATTCACA AGGAAACATT TCAGAAGCAG CAAAATATTA TCAATATTGC ATAAATCAAG CATTCAAAGA TTACAGAGTC TTTACTAATT ATGGAGTCAT TTTAAAAAAA TTTGGAAAAT TGCAAGAAGC AGAAAAATTT CAACGCGAAG CAATTCAAAT TAATCCTAAT TTCGCAGAAG CATATTCCAA TCTAGGTAAT ATATTGAGAG ATCTTGGCCA ATTAAAAGAA GCAGAGTTAT CCTTTCGCAA AGCTATTGAA ATCAAATCTG ATTACGCAGA AGCATATTCC AATCTAGGTA ATATATTGAG AGATCTTGGT CAATTAAAAG AAGCAGAGTT ATCCTTTCGC AAAGCTATTG AAATCAAACC TGATTACGCA GAAGCACATT CCAATCTAGG TAATATATTG AGTGATCTTG GCATCAAAAA AGAAGCGAAA TTAGAAAAGC AGAAAAGCAT TGACATCAAT TTTATTCAAA ATATTAACTC TAAATATAGA GAAGATTGTA AAAGGAAAAT CAGTCTATCA AAATCACAAT TCAGACAAGA TTTATTTGCC CTGACTGAGT TGGGTTTTAA AAAAAATGGT TTTTTTGTAG AATTTGGTTC TTATGATGGC CTGACTGGTT CTAATTCTTA TCTATTAGAG AAATTTTTTA ATTGGGATGG GATATTAGCT GAACCAATGA TTTCCTGTCA TGAAAAGTTA AAAAATAATC GTTCAGTACG TATAGAAACC AAATGTGTAT GGAAATCATC TGGTCAAGAG ATCTTATTTA ACGAACCTTT TTCATATAAG CAAATGGCTA CAATAGATAG TTTTTCGGAG GAGAAACCAG ATTATTATAA GGAAGGAAAA AGATATATTG TTAAATCAAT TTCATTATTA GATTTATTAG ATATAAATAA TGCACCAAAG TTTATAGATT ATTTATCTAT AGATACAGAA GGAAGTGAAT ATGATATTTT AAATGCTTTT GATTTTGAGA AATATAAATT TAGGGTAATT ACCTGTGAGC ATAATTTTAC TCCTATGAGA GAAAAAATTT ATAACCTTTT AATTAATAAC GGCTATAAAA GAAAATTAAC AAATTTATCA AGAGTTGATG ATTGGTATAT TTATGAATCC TAA
|
Protein sequence | MESSDEDQGE KNKPQVQIFT VPFTLGEIKE NINININTPS KPSKEQIINQ AFKFHSQGNI SEAAKYYQYC INQAFKDYRV FTNYGVILKK FGKLQEAEKF QREAIQINPN FAEAYSNLGN ILRDLGQLKE AELSFRKAIE IKSDYAEAYS NLGNILRDLG QLKEAELSFR KAIEIKPDYA EAHSNLGNIL SDLGIKKEAK LEKQKSIDIN FIQNINSKYR EDCKRKISLS KSQFRQDLFA LTELGFKKNG FFVEFGSYDG LTGSNSYLLE KFFNWDGILA EPMISCHEKL KNNRSVRIET KCVWKSSGQE ILFNEPFSYK QMATIDSFSE EKPDYYKEGK RYIVKSISLL DLLDINNAPK FIDYLSIDTE GSEYDILNAF DFEKYKFRVI TCEHNFTPMR EKIYNLLINN GYKRKLTNLS RVDDWYIYES
|
| |