Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18421 |
Symbol | |
ID | 4780607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1504193 |
End bp | 1505185 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085131 |
Product | hypothetical protein |
Protein accession | YP_001015662 |
Protein GI | 124026547 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.465045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAAG ACATTCTGCT TTTAACTGCT CTCGTTGCAG TTATTTTAAT GGGTTCTGCA ATGTGCTCAG GGATTGAAGC AGCATTATTA GCAGTAAACC CATTACGCGT ACATGAGCTC GCAAGGAGAA AGCCCAAAGT ACTTGGAGCT AGAAGATTAG AAAAATTACG CCACAGAATT GGAAGGACTT TAACTGTAGT AACAATTGCA AATAACAGTT TCAATATTTT TGGAAGTTTG ATGGTTGGAA GCTACGCAAC TTACATATTT CAAGATCGAA TAGGAAATGT AAAATCCATA TTTTTTGTTG GCCTAACTAT TCTTGTTTTA CTTCTTGGAG AAATTGTTCC CAAAGCTCTC GGCACAAGAC TTGCATTGCA AATTAGTTTA ACAAGCGCTC CTGTCCTGGA TTTCTTAAGC ATAGTTATGC GTCCATTGCT AATAGTTCTT GAACGTCTAC TACCAATCAT CACTGCCAAG AGCGAGCTAA CAACAGACGA AGAAGAAATA AGACAGATGG CCAGACTTGG ATCTCAAATA GGTCAAATAG AAGCTGATGA GGCTGCAATG ATATCCAAAG TTTTCCAGCT AAATGACCTT ACTGCTAAAG ATTTGATGAC TCCACGTGTT GCCGCTCCAA CACTTCCAGG AAGAGTTTCT TTACAATCTG TCAAATCAAA CTTATTAGAA AATAATGCAA CATGGTGGGT AGTATTAGGT GAAGAAGTAG ACAAGGTTGT TGGAGTTGCT AACCGTGAAA AGTTATTAGC CTCTTTACTT CAAGGAAACT CCCATTTAAC TCCTTATGAT CTAAGCGAGA ATGTAGAGTT TGTACCCGAA ATGATTCGAG TAGATAGACT ACTTCTTGGT TTTAATGAAG ACAAAAATGG AGTTAGAGTT GTGGTAGATG AGTTTGGTGG ATTTGTTGGT TTAATAGGAG CAGAAGCTGT ATTGGCAGTT TTAGCTGGTT GGTGGAGGAA GTCAAATAAA TGA
|
Protein sequence | MSQDILLLTA LVAVILMGSA MCSGIEAALL AVNPLRVHEL ARRKPKVLGA RRLEKLRHRI GRTLTVVTIA NNSFNIFGSL MVGSYATYIF QDRIGNVKSI FFVGLTILVL LLGEIVPKAL GTRLALQISL TSAPVLDFLS IVMRPLLIVL ERLLPIITAK SELTTDEEEI RQMARLGSQI GQIEADEAAM ISKVFQLNDL TAKDLMTPRV AAPTLPGRVS LQSVKSNLLE NNATWWVVLG EEVDKVVGVA NREKLLASLL QGNSHLTPYD LSENVEFVPE MIRVDRLLLG FNEDKNGVRV VVDEFGGFVG LIGAEAVLAV LAGWWRKSNK
|
| |