Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18211 |
Symbol | |
ID | 4780068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1485867 |
End bp | 1487678 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640085110 |
Product | hypothetical protein |
Protein accession | YP_001015641 |
Protein GI | 124026526 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.905605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGAC CTCATATAGA AGAGAACGGA AAGAAAAAAG TAACTGAAGG AAAAACATTC CCAGTTCCAT TTAATTTAGA AGAGCTTAAT AAAAAAAGAT CTATTACCAC GAAGACTTCT TATAAACCTT CTAAAGAACA AATCATTAAT CAAGCATTTA AATTTCATTC ACAAGGAAAC ATTTCAGAAG CAGCAAAATA TTATCAATAT TGCATAAATC AAGCATTCAA GGATTACAGA GTCTTTACTA ATTATGGAGT CATTTTAAAA AAATTTGGAA AATTGAAAGA AGCAGAAAAA TGTCAACGCG AAGCAATTCA AATTAATCCT AATTTCGCAG AAGCATATTC CAATCTAGGT AATATATTGA GTGATCTTGG TCAATTAAAA GAAGCAGAGT TATCCTTTCG CAAAGCCATT GAAATCAAAT CTGATTACGC AGAAGCACAT TCAAATCTAG GTAATATATT GAGAGATTTT GGTCAATTAA AAGAAGCAGA GTTATCCTTT CGCAAAGCTA TTGAAATCAA ATCTGATTAC GCAGAAGCAC ATTCAAATCT AGGTAATATA TTGAATGATC TTGGTCAATT AAAAGAAGCA GAGTTATCCT TTCGCAAAGC TATTGAAATT AAACCTGATT TCGCAAATAC TCATAATAAT CTAGGAATCA TATTGAGTGA TCTTGATCAA TTAAAAGAAG CAGAGTTATC CTTTCGCAAA GCTATTGAAA TCAAACCTGA TTTCATAAAA GCATACTCAA ATCTTGGGAA CATATTGAGA GATCTTGGTC AATTAAAAGA AGCAGAGTTA TCCTTTCGCA AAGCTATTAA AATCAAACCT GATTACGCAG AAGCTTATTT CAACCTTGCT TACTTAGAGC TGCTAAAAGG CAATTATAAA TCTGGACTAA AGAACTATGA GTTTAGGTTC AAAAAAAAGC AACCTACTGT TCCTCACATC AAAACAAAAT TGAAAAGAGT CACTAATGAA CAATTCCAAA AAGGAGAGAA ACTTTTAGTT GTTAGTGAGC AAGGTTTAGG AGATACTCTA CAATATATGC GATACATACC ATATCTGCAA AATCAAGGTT TTGATATTAA TTTTTGTGCC CAGACACAAC TCCATTCATT GATCAAAGCA TCAGCTATAA GTGCAGATCC ATTAATAACA GAACAAGCAA ATCAACTTTC ACTAGGACAA TGGATACCAT TATTATCCTT ACCTAAATAT CTACAAGTAA GCCCAAATAA TCCAATCATC TCAGAACCAT ATATTAACTC CACAGATAAT CTAACAAAGA AGTGGAAAAA GATACTCTCT AAAGAAAAAA GAGCAGTCAT TGGTATTAAT TGGCAAGGTA ATCCAACAAT GGAAAAAAGT TTTTATCAAG GGCGGTCAAT ACCTCTCGAA ATATTTTCAA TCCTTCCTGA ACATAATGAA ATCACAATGC TCTCATTACA AAAAGGATTT GGATCGGAAC AATTAGAGAA TTGCTCTTTC AAAAATAAAT TTGTTAGCTG CCAATCAGAA ATTGATTCCA CTTGGGATTT TTTAGAAAAT GCTGCCATTA TTGATAACTG TGATTTAATT ATTACTTGCG ATACTTCGAT TGCTCATTTA GCTGGAGGGA TGGGAAAGAA AGTTTGGTTA CTTTTAAGAG ATATCCCTTA TTGGACTTGG GGACTTGACG GGGAAACTAC ATTTTGGTAT CCATCGATGA GATTATTTCG ACAAAATAAA CGTCACAATT GGCAAGAGGT TATGGAGAGA GTATCAAGCG CACTTGAAAA AGAAATTGAG GCAAAAGTAT GA
|
Protein sequence | MDGPHIEENG KKKVTEGKTF PVPFNLEELN KKRSITTKTS YKPSKEQIIN QAFKFHSQGN ISEAAKYYQY CINQAFKDYR VFTNYGVILK KFGKLKEAEK CQREAIQINP NFAEAYSNLG NILSDLGQLK EAELSFRKAI EIKSDYAEAH SNLGNILRDF GQLKEAELSF RKAIEIKSDY AEAHSNLGNI LNDLGQLKEA ELSFRKAIEI KPDFANTHNN LGIILSDLDQ LKEAELSFRK AIEIKPDFIK AYSNLGNILR DLGQLKEAEL SFRKAIKIKP DYAEAYFNLA YLELLKGNYK SGLKNYEFRF KKKQPTVPHI KTKLKRVTNE QFQKGEKLLV VSEQGLGDTL QYMRYIPYLQ NQGFDINFCA QTQLHSLIKA SAISADPLIT EQANQLSLGQ WIPLLSLPKY LQVSPNNPII SEPYINSTDN LTKKWKKILS KEKRAVIGIN WQGNPTMEKS FYQGRSIPLE IFSILPEHNE ITMLSLQKGF GSEQLENCSF KNKFVSCQSE IDSTWDFLEN AAIIDNCDLI ITCDTSIAHL AGGMGKKVWL LLRDIPYWTW GLDGETTFWY PSMRLFRQNK RHNWQEVMER VSSALEKEIE AKV
|
| |