Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_00731 |
Symbol | |
ID | 4780807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 77837 |
End bp | 80287 |
Gene Length | 2451 bp |
Protein Length | 816 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640083336 |
Product | hypothetical protein |
Protein accession | YP_001013902 |
Protein GI | 124024786 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.456129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAGGGC AAGAAGAAAA GAAAAAAGTC ACGGAAGTGA AAACATTCCC AGTGCCATTT GCTTTAGGAG AAATTAAAGA AAATATTACT ATTAATACCA ATATTTCTTC TAAACTCTCA GAACTTTCTA AAGGACAAAT AATTAATCAA GCAATTCAGT TTCATGCACA AGGAAATATC CAAAAAGCAG CAAAATATTA TCAATATTTT ATTGATCAAG GTTTCAAAGA TCCTAGAGTT CTTGCTAATT ATGGAGTCAT ATTGAAAGGT TTTGGTAATT CACAGGAAGC AGAATTGTTA TACCGCAAAG CAATTGAACT GAATCCTAAT TTTGCAGACG CGCATTACAA TCTAGGAAAC ACATTGAGAG ATCTTGGTAA ATTAAAAGAA GCAGAATTGT CATACCGTAA AGCTATTGAA ATTAGTCCTA ATTACGCAAA TACACTTTAC AACCTTGGAA CTATATTGAG TGATCTTGGT AAATTGCAAG ATGCAGAATT TTCATACCGC CAAGCTATTA TAATTAATCC TAATTATACA GAAGCTCATT ACAATCTAGG AAACACATTG AGAGATCTTG GCAAATTAAA AGATGCTGAA TTATCATACC GTAAAGCTAT AAAAATTAGC CCTAATTACG CAAAAGTACA TTGCAATCTG GGAACAATAT TGAGAGATCT TGGTAAATTA AAAGATGCTG AATTATATAC CCGTAAAGCA ATTCAGCTGA ATCCTGATTT CGCAGAAGCG TATTCTAACC TAGGAAATAT ATTGAGTGAT CTTGGTAATT TAAAAGAAGC AGAAATCTCA CAAAAAAAAA CAATTGAACT TAAACCTGAT TGCGCAGAAG CGCATTCCAA TCTTGGTAAC ATATTGAGAG ATCTTGGTAA ATTAAAAGAT GCAGAGTTGT CGTACCGCAA AGCTATTGAA ATTAGTCCTA ATTACGCAAA TGCGCATTCC AATCTTGGCA ACATATTGAG AGATCTTGGT AAATTAAAAG GTGCAGAGTT GTCGTACCGC AAAGCTATTG AAATTAGTCC TAATTACGCA AACGCACATT ACAACCTTGG AAATATATTG AAAGATATTG GTAATTTCGG TGATGCTCTA AAGCAATTCA AGCAGGCACT AAAATTAAAC AATGAATTAT CATTAGCGAA GTATGCTTTA ATTATAACCA AAGGCAAGAT ATGCGATTGG AGCGATGAAG TTACTCATAA TATATGGCTT AAATCACTTG GGATCCAAGG GAAATCTATT GAACCATTGG GATTATTTCC ATTAGAAGAT AATCCGTCAA ACCATTTAAA AAGATCAAAA AATTATTATA TAGAAAACTT TACTCGACCA AGTAAATGTA TTCAATCTTA CGAGAAAAAT ATAATACATA TTGGTTATTT CTCTGCTGAT TTTCGGACCC ATCCTGTAAT GCAAATGCTT GCTCCTCTAA TAGAGATACA TGATAAGTCT AGATTTAAAA TATATTTATA TTCATTAGCA AAAAAAGAAG ATGAGTATAC TGAAAGAGCA AAAATGTCTG GATGTATATT TAAAAATATA ACAGAATTAA ATGATATTGA AGCAGTTGAA TTAGCGAGAA GTGACAAATT AGATATTGCC GTAGATCTAA TGGGATATAC TCGAAATAAT AGAATGCCTA TATTCTCATA TAGGGTTGCA CCAATACAAA TCAATTATTT AGGTTTTCCT GGCTCAACTG GCTCTGATAC TATTGATTAT ATTATCGGCG ATAATATAAC TATACCTAGA GAAAATGAGA AATTTTATAC TGAAAAAATA ATAAGAATGC CAAATTGCTT TCTATGTGAT GATAATAAAA AAGAAATTTC TAAAGAGTCA ATATGCCGTA AAGATTTTAA TCTTCCTGAT CAAGGCTTTA TATTTACTTG TTTTAATGAA AATTATAAAA TAACAAAAAA AGAATTTAAC ATATGGATGA ACTTACTTAT AAAAATAGAA GGGAGTGTTC TTTGGTTATA TAAGTCAAAT CAATGCTCAA TGAATAATTT ATATAAGGAA GCTAGAAAAC GTAAAGTTAA TCCCGACAGA TTAATATTTG CTGAGAGATT AGCAATGAAC AAACATCTTC CTAGGCATTC TCTAGGTGAT TTAGCACTTG ATACTTTTAA TTACAATGGA GGTGCGACAA CTTCTTGTGC ATTATTGGCT GGGCTACCAG TCCTGACCAA GATAGGTCAG AGTTTTATGG CTAGAGTATC TGCCAGTCTC CTTTCTTCGA TAGGACTTTC CGAATTAATT ACTTATAGTG AAAGTGAATA CGAAGAAAAA GCATTATATA TTGCTAATAA CCCTAAAGAA ATTCTTAGAT TGAAATCTAA ATTGAACAAA CTAAAAGAAA CTTCAACACT TTTTAATTCA GAATTATTTA CAAGAGATTT AGAGAGTAAA TTTATAGAAT TAGTGAAGTA A
|
Protein sequence | MGGQEEKKKV TEVKTFPVPF ALGEIKENIT INTNISSKLS ELSKGQIINQ AIQFHAQGNI QKAAKYYQYF IDQGFKDPRV LANYGVILKG FGNSQEAELL YRKAIELNPN FADAHYNLGN TLRDLGKLKE AELSYRKAIE ISPNYANTLY NLGTILSDLG KLQDAEFSYR QAIIINPNYT EAHYNLGNTL RDLGKLKDAE LSYRKAIKIS PNYAKVHCNL GTILRDLGKL KDAELYTRKA IQLNPDFAEA YSNLGNILSD LGNLKEAEIS QKKTIELKPD CAEAHSNLGN ILRDLGKLKD AELSYRKAIE ISPNYANAHS NLGNILRDLG KLKGAELSYR KAIEISPNYA NAHYNLGNIL KDIGNFGDAL KQFKQALKLN NELSLAKYAL IITKGKICDW SDEVTHNIWL KSLGIQGKSI EPLGLFPLED NPSNHLKRSK NYYIENFTRP SKCIQSYEKN IIHIGYFSAD FRTHPVMQML APLIEIHDKS RFKIYLYSLA KKEDEYTERA KMSGCIFKNI TELNDIEAVE LARSDKLDIA VDLMGYTRNN RMPIFSYRVA PIQINYLGFP GSTGSDTIDY IIGDNITIPR ENEKFYTEKI IRMPNCFLCD DNKKEISKES ICRKDFNLPD QGFIFTCFNE NYKITKKEFN IWMNLLIKIE GSVLWLYKSN QCSMNNLYKE ARKRKVNPDR LIFAERLAMN KHLPRHSLGD LALDTFNYNG GATTSCALLA GLPVLTKIGQ SFMARVSASL LSSIGLSELI TYSESEYEEK ALYIANNPKE ILRLKSKLNK LKETSTLFNS ELFTRDLESK FIELVK
|
| |