Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_19301 |
Symbol | |
ID | 4779584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1586620 |
End bp | 1587630 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640085220 |
Product | hypothetical protein |
Protein accession | YP_001015750 |
Protein GI | 124026635 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.665373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTT TTCAGAGTTT GCTGGTAGCT CCAGCAGCTT TGGGCCTAAT GGCTCCAATT GCAGTTAATG CAGATACTGC ATTTTCATCA ACAACATCTC TTTCTGGTGG TGCTGTTTTT ACTATCGGTT CTGTTGCCGA TGGAGGAACA TCTGATACAG AAGAAGAGCT ATATATGCAA TACGGATATG GACTGGACGT AACTTCCAGT TTTACCGGTG AAGATATGCT CTACATGGGT ATGGAAACTG GTAATGCTAG TGGTCCATTA GCAAGCATGG ATAGCTCAAC CGGCGGTACT GGTGCTATTA CATTGCATTC TTTGTACTAT GCCTTCCCTT TGGGCGATCT TTCAGTAACT GCTGGACCGT TACTTGATCA GGATGATGTC GTTGCTGCAA CAACATCTGC TTACTCAGAT GCTTTCAGAC TAGGCAGCAT GCCTTACTCA TTGGCTGGTG GTGAAACTGG TCCTGGAGTT GGTGTTGCTT ACTCTGGAGA CAACGGCGTA GTTGCTTCTG TTAGCTTCGT TTCTGTTGGA GGCTCTGATT CAACAGTAGG AATCGGCGCT GATGATGGAG ATGATGTTTC TACATTCACT CTTGGTTATG ACGGCGACGG CTTTGGTGGC GGACTTGTAA TCGCTACTAA CGACGGTGAA GCTGGAACAA GTGGATACGA CACATTTGGT GGCGGTATCT ATTACAGCCC TGAGTCAGTT CCTGCGACAA TAAGCGTTGC TTACGACACA ACAGATCCAG AGACAGGTGC TGATGCAACT GACTTGTTCG TTGGTGTTGA CTACGAAGTT GGCCCTGGAA CATTAAGTGC TGCTTACAAT TCAACTGATA TTGATGGCAG TGATTCTGAA GACTCAACAG GATTTGAAGT TTCTTACAGC TACGGACTTA ATGACTATGT CTCAGTAACA GCTGGATTCT TCACTGTTGA AGATACAAGT ACTGGTGATG ACGATACTGG TGTAGTTGCT GAAACCTATT TCAGCTTCTA A
|
Protein sequence | MKFFQSLLVA PAALGLMAPI AVNADTAFSS TTSLSGGAVF TIGSVADGGT SDTEEELYMQ YGYGLDVTSS FTGEDMLYMG METGNASGPL ASMDSSTGGT GAITLHSLYY AFPLGDLSVT AGPLLDQDDV VAATTSAYSD AFRLGSMPYS LAGGETGPGV GVAYSGDNGV VASVSFVSVG GSDSTVGIGA DDGDDVSTFT LGYDGDGFGG GLVIATNDGE AGTSGYDTFG GGIYYSPESV PATISVAYDT TDPETGADAT DLFVGVDYEV GPGTLSAAYN STDIDGSDSE DSTGFEVSYS YGLNDYVSVT AGFFTVEDTS TGDDDTGVVA ETYFSF
|
| |