Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20911 |
Symbol | |
ID | 4779113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1733021 |
End bp | 1734010 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085387 |
Product | putative transcriptional regulator |
Protein accession | YP_001015911 |
Protein GI | 124026796 |
COG category | [K] Transcription |
COG ID | [COG1725] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.605524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.66693 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGATTCC ACATACAACA GGACAGTGAA ATCCCTGCAT CAAACCAGTT ATATAATCAA GTTTGTTTTG CTATTGCCGC AAGACATTAC CCACCTGGAC ATAGGTTGCC CAGTACCAGA CAACTTGCAA TGCAAACTGG TTTACATCGA AATACTATAA GCAAAGTCTA TAGACAGCTA GAAACAGATG GAGTTGTTGA AGCTATTGCT GGTTCAGGTA TTTATGTTCG GGATCAGCAA AAACAAAAAG ATTTAAGAAG TAGCTCTCCT TCACTACGTA AAAAAGGCAT CAAAGATATA GACCATGAGA TCCGTAAAAG TATTGATGAA CTTTTAAATG CTGGATGCAC CTTGCAGCAA ACGAGAGAAT TATTTACCCG TGAAATTGAT TGGAGACTGA GATGCGGAGC TCGCTTACTT GTTAGTACGC CTAGAGAAGA TATTGGAGCA TCATTACTAA TTGCTGAAGA ATTGGCTCCT CATTTAGATG TCCCAGTAGA GGTTGTTCCA ATGGAAGAAC TAGAGAGTGT TTTAGAAAGC TCAAGCAAAG GAACAGTCGT GACAAGTAGA TATTTCTTAC AGCCTTTAGA AGAATTAGCC AAGCGACACA AAGTAAGAGC CGTAGCTGTA GATTTAAACG ATTTTAAACA AGAACTCAAC ATTCTAAAAA AGCTACGCAC AGGAAGTTGT GTAGGGATAG TAAGTATCAG TCCAGGAATA TTAAGGGCAG CAGAAGTTAT TTCACACAGT ATGCGCGGCA ATGAACTCCT TTTAATGACA GCAAATCCCG ATGTAGGAAG TCGTCTTATT GCCTTATTAA GAGCAGCTAG TCACATAATT TGTGATAGCC CTAGCTTGCC TGTTATCGAA CATACTTTGA GACAAAATAG AACCCAATTC ATGAGAATGC CACAAATTCA TTGTGCGCAA AAGTATCTGA GCGACTCTAC AATTGAAGAG TTGAGTAAAG AAATTGGCCT TCTCGAATAA
|
Protein sequence | MRFHIQQDSE IPASNQLYNQ VCFAIAARHY PPGHRLPSTR QLAMQTGLHR NTISKVYRQL ETDGVVEAIA GSGIYVRDQQ KQKDLRSSSP SLRKKGIKDI DHEIRKSIDE LLNAGCTLQQ TRELFTREID WRLRCGARLL VSTPREDIGA SLLIAEELAP HLDVPVEVVP MEELESVLES SSKGTVVTSR YFLQPLEELA KRHKVRAVAV DLNDFKQELN ILKKLRTGSC VGIVSISPGI LRAAEVISHS MRGNELLLMT ANPDVGSRLI ALLRAASHII CDSPSLPVIE HTLRQNRTQF MRMPQIHCAQ KYLSDSTIEE LSKEIGLLE
|
| |