Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20341 |
Symbol | |
ID | 4779073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1677646 |
End bp | 1678959 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640085327 |
Product | hypothetical protein |
Protein accession | YP_001015854 |
Protein GI | 124026739 |
COG category | [S] Function unknown |
COG ID | [COG4370] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03492] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.433263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAATCCT CTGCCTTACC ACTTGGCCAT GCCGCCGAAA ATGAAAGTTT CATTTCCAAT TCAAATCGTA TCAGTCAAGC CGAACAAAAT CCTTTGTTTC TTTGTAATGG ACATGGAGAA GACATGATTG CATGCAGGGT TATAGAAGCT CTCCATGAAA TGAATCCAAA TATCTCCCAA GAAGTTCTCC CAATGGTTGG AGATGGGAAA GCATTTTTAA AGCATGTAAA AAATGGTTGG CTTGCAAAGA TTGGCACCTC AATATTTTTG CCAAGCGGAG GATTTAGTAA TCAGAGTTTT AGTGGTTTGG TTTTAGATTT AAAAGCTGGA TTGTTAGGAA GTCTTTGGGT TCAATGGACT TTGACTCAGA GAGCAGCCAA AGAAGGAAAA ATTATCGTTG CAGTTGGAGA TTTATTACCT CTTCTTTTTG CATGGGCTAG TGGGGCAAAT TATTTTTTTA TTGGCACTCC TAAAAGTGAT TACACATGGG CCAGTGGTCC AAGATCCGCT TTAAGTGATT GTTACCATCG ATTGAAAGGA ACTGAGTGGG ATCCATGGGA ATATTGGTTA ATGCGATCTA GTCGATGCAA GATGGTTGCA GTAAGAGACA AAATCACTGC TAGAGGTTTG AGAAATCACG GTGTAAAGGC ACTGTCCCTG GGAAATCCAA TGATGGATGG AATTTCTAAA AGAGAATGTC CTAATGACTT TAAAAGATAT AGGCGTTTGA TTTTGTTATG TGGAAGTCGT TTGCCTGAGG CGTATCAGAA TTTTAAAAAA CTTTTAATTG CAATCCAGTT TATTCGAATT AAATCTTCCA TTGCAGTCTT TGTTCCTTTA AGTTCTTCTG CAATGAGAAA AAAAATAGCA TTAATATTGA TGGACTTAGG CTTTAAATCT ACTTATCAAT CAACAGGTCA AAATGGGATT TCAGAAATAT GGAAAAAAAA CTCATTACTT ATATTGATTG GCTTTAACCA ATTTTCTTAT TGGGCTAAGT GGGGAGAAGT AGGAGTAGCT AATGCAGGTA CAGCTACAGA ACAAGTAGTA GGTTTAGGAA TCCCATGCGT TTCCTTGCCA GCAAAAGGAC ACCAATTTAA TTTCAATTTT GCTAAGCGTC AAAGTCGTTT ATTAGGAGGA TCGGTGGCTA TTGCTAAGAG TTATGAAACT CTCGCAAAAC AAGTAGAGTT TTTACTGAAC TCTGATTTTG ATAGAGAGAT TATTGGCTTA AGAGGGGCTC AAAGAATGGG TCCAGAGGGC GGAAGTCACT CTATAGCACT TAGTATTTCG AATCACTTGT CCCAGGGCTC ATAG
|
Protein sequence | MQSSALPLGH AAENESFISN SNRISQAEQN PLFLCNGHGE DMIACRVIEA LHEMNPNISQ EVLPMVGDGK AFLKHVKNGW LAKIGTSIFL PSGGFSNQSF SGLVLDLKAG LLGSLWVQWT LTQRAAKEGK IIVAVGDLLP LLFAWASGAN YFFIGTPKSD YTWASGPRSA LSDCYHRLKG TEWDPWEYWL MRSSRCKMVA VRDKITARGL RNHGVKALSL GNPMMDGISK RECPNDFKRY RRLILLCGSR LPEAYQNFKK LLIAIQFIRI KSSIAVFVPL SSSAMRKKIA LILMDLGFKS TYQSTGQNGI SEIWKKNSLL ILIGFNQFSY WAKWGEVGVA NAGTATEQVV GLGIPCVSLP AKGHQFNFNF AKRQSRLLGG SVAIAKSYET LAKQVEFLLN SDFDREIIGL RGAQRMGPEG GSHSIALSIS NHLSQGS
|
| |