Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_00421 |
Symbol | |
ID | 4779864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 43375 |
End bp | 44829 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640083305 |
Product | hypothetical protein |
Protein accession | YP_001013871 |
Protein GI | 124024755 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.391795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGTT TTGGAGAACA GCATAAATCT AAAAAGAAAA GAAATGAGAC AACTACATTC CAAGTTCCAT TTCCATTTCC TTTAAGAGAA AGTCAAGAAA ATATTAATAT TATTACTAAT ACTCCTTCGA AACTTTCTAA AGAACAAATA ATTAATCAAG CATTAAAGTT TCATTCAAAA GGAAACATTT CAGAAGCAAC AAAATATTAT CAATATTTTA TCAACCAAGG TTTTAAAGAT CATAGAGTTT TTTCTAATTA TGGAGCAATC CTAAGAAAAC AGGGGAAAGT AAAAGAATCA GGGTTTTTCA TGAGAAAAGC AATTGAAATT CAACCTAATA TCCCCATAAT AAATTTTAAC ATGGGAAATA TATTAAAAGA TCTAGGTAAA TTAAAAGAAG CGGAAATATT TATTCGTAAA TCAATTAGAA TGGATCCTAA TCTTACAAAA GCATATCTTT CTTTATCAAC TATTACATAC TCAAATATCG ATAATAAAAC TTGGCAGGAT AATCTTTTTT CTGATTTTAT ATTAAAGTAT AAATCACAAG TAGAGCAAGT TGAAATCTAC TTCGCTAGAG CAAATATTCT TCATAAAGAA AAAAAATATA AGAAAAGTTA TCAAAACTTA AAATTGGCAA ATAACTTACA ACTTGCAATT AAGGCAAGTG ATATTAATAA AAGAATAAAA AAGTCTCAAT TATTACTTAT TGAATCTGAT AAGAAGGTAG TTAATAAAAA AGAAAAGATA AATTATGCCC AGAGTATTTT TATTTTAGGG TTGCCAAGAT GTGGTTCAAC ATTATTGGAA TCAATTCTTA GTATGAATAA AGACGTATAT GATCTAGGTG AAATTAATAT TTTAGAAGAA TCATTTATAG AGTATAAGAA GTTTATACAA GATATTAGTC TTGCTGAACT TTATAATGAT AAAGTTAAAA ACAAGACTGA CTTGAATATC ACAACAAATA AATGGTTATA TAACTATCAA TATGCAGGTA TTATCGCTCG CCATATTCCA AACGCAAAAC TTATTCATTG TTTTCGACAT CCTTTAGACA ATATTCTTTC ACTTTACCGA ACACATTTTG CTAGTGGAAA TGAATATTCT TCATCCTTAG TTGATTGTGC ACGAGTTTAT TTAGATCAGG AGAAGGTAAT GAAAGAATAT AAAAATAGAT ATAGATCAAA AATTTACGAT TTAAATTATG ATTCATTAGT GAACGATCCT AATAAAGAAA TTAGATCGTT AATCTCTTGG TTGGGTTGGG AATGGCAAGA TTCATTTCTA TCTCCGCAAC TAAATTCACG CTCAGTTTCA ACAGCAAGCA ATGTTCAAGT TCGTTCCCCA ATTAATTCAA AATCTATTGG TGGATGGAAG AACTACAAGG ACATGCTTAA ACCTGCGATT GAAATACTTG CTAAAACCAA TGGATATAAA AACATCGCTT CCTAA
|
Protein sequence | MKGFGEQHKS KKKRNETTTF QVPFPFPLRE SQENINIITN TPSKLSKEQI INQALKFHSK GNISEATKYY QYFINQGFKD HRVFSNYGAI LRKQGKVKES GFFMRKAIEI QPNIPIINFN MGNILKDLGK LKEAEIFIRK SIRMDPNLTK AYLSLSTITY SNIDNKTWQD NLFSDFILKY KSQVEQVEIY FARANILHKE KKYKKSYQNL KLANNLQLAI KASDINKRIK KSQLLLIESD KKVVNKKEKI NYAQSIFILG LPRCGSTLLE SILSMNKDVY DLGEINILEE SFIEYKKFIQ DISLAELYND KVKNKTDLNI TTNKWLYNYQ YAGIIARHIP NAKLIHCFRH PLDNILSLYR THFASGNEYS SSLVDCARVY LDQEKVMKEY KNRYRSKIYD LNYDSLVNDP NKEIRSLISW LGWEWQDSFL SPQLNSRSVS TASNVQVRSP INSKSIGGWK NYKDMLKPAI EILAKTNGYK NIAS
|
| |