Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07411 |
Symbol | |
ID | 4779083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 682075 |
End bp | 683277 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640084016 |
Product | hypothetical protein |
Protein accession | YP_001014564 |
Protein GI | 124025448 |
COG category | [S] Function unknown |
COG ID | [COG1565] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0067481 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTGACC CGATCGCGAG ATGTCCTAAA TGGTTAATTG ATCGCATTGG AGATAGTGGC GGTTCAATTA GTTTCTACAG ATATATGGAT TTAGTTTTAA ATGATCCAGA TAATGGATTT TATTCAACTG GAAAATTGAA TATTGGTAAG AATGGAGACT TTTGTACTTC ACCTTCTTTG AGTAATGATT TTGCACGTTT ATTAGCTATT CAAGTGGTTG ATTGGCTTCT TGATCTGGAA AAATCAGGAA TTGATTCCAA ATTGTTGTCT CTTATTGAGA TTGGCCCAGG AGAAGGAACT TTATCAAGAG ATTTGATCTT GGCTATCGCT GAAATCGCAC CTGCTTTAAT CTGTAAAATT GAGCTTGTAT TAGTTGAATT AAATGTAGGC ATGAGAAGAC GACAAGAAAA AGTAGTTAAT AATTTGGAGG GGATAAATTG TCGCTGGAGC AGTATCGAAG ATCTCATCTT AAGACCAGTT AATGGTGTAG TTATTGCTAA TGAAGTTTTG GATGCATTCC CAGTAGAAAG ATTGGTTTTT AGTGACAATA AAGTTTTTAG GCAGGGAGTT GGTTTGAAAA AAATAAATGA TGAAAATTAT TTGGAGTTTG TTGACCTCAA GCCTACTTCG AAGATTATTA AATTTTTGAA AGAATCTAAT AGCCTTTTAA AAATTGAGTT TCCACCAAAG GATATTTGTA ATAGATGGGT CACCGAATGG CATTGTGATG TCCCGAGTTG GTTTGGGAAT TTGTCTAAGG TTTTAATTGA TGGCGCATTA TTAGTTGTCG ACTATGCGAT GGAATCGAAG CGTTACTACA ACGCAATGAG ACAAGAAGGT ACTCTTATTT CCTATAGAAA TCATGTGGCA AACCCTAATG TTTTAAAAGA TGCTGGCTTG TGTGATTTAA CAGCACATCT GTGCATCGAA TCAACCATTA ATTATGCTCT GTTTAACGGA TGGAAGTTTA TGGGGGAAAC TAGGCAGGGA CAAGCTCTCT TGGCATTAGG ACTTTCAAAT TTTCTTTATT CTCTTCAAAA TAATAGTAAT AATGATCTCT CAGCCGCATT AAATCGTAGA GAGTCATTAT TGAGGCTAGT TGATCCAATT GGACTAGGTG ACTTTAGGTG GTTGGCTTTT CAGAAGGATA ATAGTGATGA TTTGATTTTA AGAAACCGTT TTCTTGAAGA GCCAATTAGC TAA
|
Protein sequence | MIDPIARCPK WLIDRIGDSG GSISFYRYMD LVLNDPDNGF YSTGKLNIGK NGDFCTSPSL SNDFARLLAI QVVDWLLDLE KSGIDSKLLS LIEIGPGEGT LSRDLILAIA EIAPALICKI ELVLVELNVG MRRRQEKVVN NLEGINCRWS SIEDLILRPV NGVVIANEVL DAFPVERLVF SDNKVFRQGV GLKKINDENY LEFVDLKPTS KIIKFLKESN SLLKIEFPPK DICNRWVTEW HCDVPSWFGN LSKVLIDGAL LVVDYAMESK RYYNAMRQEG TLISYRNHVA NPNVLKDAGL CDLTAHLCIE STINYALFNG WKFMGETRQG QALLALGLSN FLYSLQNNSN NDLSAALNRR ESLLRLVDPI GLGDFRWLAF QKDNSDDLIL RNRFLEEPIS
|
| |