Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15211 |
Symbol | |
ID | 4780995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1235703 |
End bp | 1237268 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640084803 |
Product | hypothetical protein |
Protein accession | YP_001015343 |
Protein GI | 124026227 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.270721 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAAAG GCAATCTCGC CATAGTCCTG CATGCCCATC TACCTTACGT GAGAGCAGAA GAACCTGGTT CATTAGAGGA GGACTGGTTC TTTCAAGCTT TAGCGGAATG TTATTTACCA CTTTTAGAAA CACTTGAAAA TGCCTCTAGA TCAAAAGATC AAGCACCAAA AATCACAATA GGATTATCTC CTACTTTGCT TTCGCTTTTG GGTGATGAAG TATTAAAAAA TCGATTTGAA GAATGGGTAA CTATAAGAAT AGAAGTTCTT AATACATTAG AAACAGACTG CATAAAAGCA GTTCTTCATC TTAAAGGCCA CTTAAAACGC CAACTAGAAA GCTGGAAGAT TTGTCAAGGA GACGTAATAG GAAGATTTAA AAAATTACAA ACATTGGAAG TGATTGATAT TCTTACTTGC GCAGCCACTC ATGGATATCT TCCTCTTTTA AGAGAAAATC CTGAAGCTGT AAGGGCTCAA TTAAAAACAG CAGTAACTGA ACATAAGCGT CTTTTCATGA GCGCGCCATT GGGTATTTGG TTGCCTGAAT GTGCCTATTA TGAAGGCTTG GATGAATTAA TGGCTGAATC AGGCTTGAGA TATGCAGTCC TGGATGGTCA TGGCTTGTTG AATGCAGATC CAAGGCCAAG ATATGGTCTG TATGCTCCTA TTTGTACAAG AAAGGGAGTA GCTTTTTTTG GTAGAGATAG TGAATCAACT CTTCCAGTAT GGTCAGCAAG GGACGGCTAT CCTGGCAATC CAAGCTACAG AGAATTTCAT AGGGACTTGG GCTGGGATTT ATCAATAGAG AACCTAAAAA AGATAGGAAT TAAGGGGAAA AGGCCCTTAG GAATCAAATT ATTTAAGATT ACCTCTCGAA ATACATCTTT AGAGAATAAA CAAGAGTATG ACCCAGAGGC GGCAAATGAG AGTGCTGAAA AAGATGCAGA TAATTATTTA AAGGAAAGGA AGAAACAACT TATAAAGCTT GAAAAATCAA TGCAAATAGA GCCATTATTA ATCGCTCCTT TTGATGCAGA ACTTTTTGGT CATTGGTGGT TTGAAGGCCC GAAATTCCTA TCCTATTTAT TTATCAAATC AAAAAAAGAG GGTATAAAAT TAATTACTTT AAAAGAGTCT CTTAAATTGA CACCCAAAAT CCAATTATGC AATCCTTCTC CATCTAGTTG GGGACAAGGT GGTTTTCATA ATTATTGGTT AAACAAATCA AATGCATGGA TTGTTAATGA ATGGAGTAAA GCTGGAAGAG CAATGGTAAG TATTTGTTCA GATAGTTTAA TAGAAGAATC AAATATCAAA ATTATTAATC AAGCTGGAAG AGAGCTTTTA CTATGTCAGT CTTCAGACTG GAGTTTCATT CTTAAGGCTG GAACTACTAC CGAGCTGGCA AGAGAAAGAA TAAATTTGCA TCTAAAAAGA TTTTGGATGT TAATAAATAC AATAAAAAAT AATAAAATTA TAAATGAAAA GATTCTAGAG GAAATAGAAA AAGAGGATTG TCTATTTCCT TTGATTTCTC TTATTGATTG GAAAAAGAAA AGCTAA
|
Protein sequence | MTKGNLAIVL HAHLPYVRAE EPGSLEEDWF FQALAECYLP LLETLENASR SKDQAPKITI GLSPTLLSLL GDEVLKNRFE EWVTIRIEVL NTLETDCIKA VLHLKGHLKR QLESWKICQG DVIGRFKKLQ TLEVIDILTC AATHGYLPLL RENPEAVRAQ LKTAVTEHKR LFMSAPLGIW LPECAYYEGL DELMAESGLR YAVLDGHGLL NADPRPRYGL YAPICTRKGV AFFGRDSEST LPVWSARDGY PGNPSYREFH RDLGWDLSIE NLKKIGIKGK RPLGIKLFKI TSRNTSLENK QEYDPEAANE SAEKDADNYL KERKKQLIKL EKSMQIEPLL IAPFDAELFG HWWFEGPKFL SYLFIKSKKE GIKLITLKES LKLTPKIQLC NPSPSSWGQG GFHNYWLNKS NAWIVNEWSK AGRAMVSICS DSLIEESNIK IINQAGRELL LCQSSDWSFI LKAGTTTELA RERINLHLKR FWMLINTIKN NKIINEKILE EIEKEDCLFP LISLIDWKKK S
|
| |