Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07101 |
Symbol | |
ID | 4780085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 653007 |
End bp | 654020 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640083984 |
Product | hypothetical protein |
Protein accession | YP_001014533 |
Protein GI | 124025417 |
COG category | [S] Function unknown |
COG ID | [COG2138] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.129916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.330282 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTTCCC CTGCATCAAT TAACTATAAA TATCCATCTA ATATTGGAAT TTTGTTATGC GGACATGGTA GTAGAGATCC CCAAGCAGTA AAAGAGTTTA TAAATGTAGT AAATAAAATA AAATCTAGAA TACCTGATAT CCCAGTTGAA TTCGGTTTTC TAGAATTTAA TCGACCAATA ATTAGTGAGG CCCTAGATCA GCTTAGGGAT TTGGGAGTTG AGAGAGTGAT TGCTTTACCA GCTATGTTAT TTGCTGCCGG GCATACTAAA AATGATATCC CTGCGGTTTT GAATAAATAC TCCGCTGATA ATGGACTTCT AATTCAATAT GGCAGGGAGC TTGGTTTGAA TTCTTTGATG ATTGGAGCAG CAGGAGCAAG AATCAAAGAA ACAATTGATA GTAATCCAAT ATTTCCTCTT CATGAAACAT TACTTGTCGT CGCAGGTAGG GGATCGTCCG ATCCAGATGC TAATTCAAAT GTATGTAAGA TTACAAGGAT GCTTGTTGAG GGATTTGGTT TTGGATGGGG AGAAACTGTT TTTTCAGGAG TAACATTTCC CCTTGTTGAT CCTGGCTTGA GACATGCTCT CAAATTAGGT TTTAAAAGAG TAATTCTCTT ACCTTATTTT CTTTTTTCTG GAGTTTTAGT CAGTCGCGTT AGAGAACATT CTACGAGAGT TGCAAATGAT AATCCTGATG TGAAGTTTCT AAACGCAAGT TACTTATCAG ACCAAGATTT AGTCATTGAT ACTTTTATGG AAAGAATTCA AGAAGTTTTT GATGGTGAGA ATTTTATGAA CTGTGCTTTA TGTAAATATC GTTCTAATTT ATTAGGTTTT GAAAGCGAGG TTGGATATGA GCAGATCAGT CATCATGATC ATGTTGAAGG TTGTCTAGAC ATTCGCCGAG AAAACAAAGA GCATAATCAC GAGCATGAAC ATTTTCCTTA TCCACATGCA AAGCATCCTT TAGGACCTGT CACGCTTCCC TCTTTAAACA AAAGCCAAAT CTAA
|
Protein sequence | MISPASINYK YPSNIGILLC GHGSRDPQAV KEFINVVNKI KSRIPDIPVE FGFLEFNRPI ISEALDQLRD LGVERVIALP AMLFAAGHTK NDIPAVLNKY SADNGLLIQY GRELGLNSLM IGAAGARIKE TIDSNPIFPL HETLLVVAGR GSSDPDANSN VCKITRMLVE GFGFGWGETV FSGVTFPLVD PGLRHALKLG FKRVILLPYF LFSGVLVSRV REHSTRVAND NPDVKFLNAS YLSDQDLVID TFMERIQEVF DGENFMNCAL CKYRSNLLGF ESEVGYEQIS HHDHVEGCLD IRRENKEHNH EHEHFPYPHA KHPLGPVTLP SLNKSQI
|
| |