Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08711 |
Symbol | |
ID | 4779610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 807555 |
End bp | 808775 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640084146 |
Product | hypothetical protein |
Protein accession | YP_001014694 |
Protein GI | 124025578 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03573] N-acetyl sugar amidotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.484525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000933108 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGAATCAC TTTATGGAAC ACCTGAAATA ATTTGTTTTT GCAAAAATTG CGTAATATCA AATCAAAGAC CTCGATCAAC AATTGAGTTT AAACATACAA GTTCTGACAA GAAACAAACT ATGGGCTTTG ATGAAAATGG TATTTGTGAT GCGTGTAAAT ACAGTCAAGT AAAGGCAAAC TTAATAGACT GGAAACTTAG AGAAAGCAAA TTAATTGAAT TATTAGATAA ATACAGAAAA AACAATGGTC AATATGATGT TGTAGTACCA GGCAGTGGAG GTAAGGACAG TGCTTTTACA TCTCACATAC TAAAATACAA ATATGGAATG AATCCGCTAA CTGTAACTTG GGCGCCACAT TTATACACAG ATATAGGATG GAAAAACTTC ACAAACTGGT CCCATATTGG CGGACACGAT AATTTATTAT TCACCCCTAA TGGAAAATTA CATAGACACC TTACTCGATT AGCATTTTTA AACTTATTAC ATCCTTTTCA GCCTTTTATT GTTGGTCAAA GAATCATTGG TCCATTACTA GCAGCCAAAC ATGGTATCAG TTTAGTAATG TATGGAGAAA ATCAAGCTGA ATATGGAAAT GATCCAAATG AAAACTTTGA TCCAACAATG GACCAAAAGT TCTTTACAGA GGGTAATCCT CTAGAGATGA AACTTGGTGG TATTTCAATC AAGGATATAA TTAAAAGTAG TGAATTCACG CTTCAAGATT TTTCACCTTA TATTCCTCCA TCTGCTGAAT TCTTAGAGAA ACAAAAGGTA GTGGTTCATT ACCTAGGATA TTATCTCGAA TGGGATCCAC AAGAATGTTA TTACTACGCT GTTGAAAATA CAGGCTTTGA GGCGAATACG GAAAGAACAC CTGGCACATA TTCTAAATAT TCCAGTATTG ATGACAAAAT TGATATGTTT CACTATTACA CAACATACAT AAAATTTGGT ATTGGTAGGG CTACCTATGA TGCATCACAA GAAATACGAA ATGGGAAAAT AACTAGAGAA GAGGGTGTGC ATTTAGTAAA TAAGTATGAT GCAGAATTTC CAGAAAAATA CTTCAAAGAT TTCTTAGAAT ATATTGATAT AGATCGAGAA AAATTTATAG AGACAGTAGA TAATGCCAGG TCACCTCACC TTTGGGAGAA AGTAAATAAT GAATGGAAAT TAAGGTATAA AGTTAGTAAT AAAAACCATT TTGATTCGTA G
|
Protein sequence | MESLYGTPEI ICFCKNCVIS NQRPRSTIEF KHTSSDKKQT MGFDENGICD ACKYSQVKAN LIDWKLRESK LIELLDKYRK NNGQYDVVVP GSGGKDSAFT SHILKYKYGM NPLTVTWAPH LYTDIGWKNF TNWSHIGGHD NLLFTPNGKL HRHLTRLAFL NLLHPFQPFI VGQRIIGPLL AAKHGISLVM YGENQAEYGN DPNENFDPTM DQKFFTEGNP LEMKLGGISI KDIIKSSEFT LQDFSPYIPP SAEFLEKQKV VVHYLGYYLE WDPQECYYYA VENTGFEANT ERTPGTYSKY SSIDDKIDMF HYYTTYIKFG IGRATYDASQ EIRNGKITRE EGVHLVNKYD AEFPEKYFKD FLEYIDIDRE KFIETVDNAR SPHLWEKVNN EWKLRYKVSN KNHFDS
|
| |