Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20461 |
Symbol | |
ID | 4779901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1688949 |
End bp | 1690301 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640085340 |
Product | hypothetical protein |
Protein accession | YP_001015866 |
Protein GI | 124026751 |
COG category | [S] Function unknown |
COG ID | [COG3395] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAA TTATTTTTGA TGATGATCCA ACTGGATCTC AAACTGTTTA TGGATGCCCA TTATTATTAA ATTGGGATGA ACAAAATCTA GAGAAAGCAT TTAAACAGTC TTCTCCTTTA ATATTTATAC TTGCGAACAC AAGATCTTTA TCTTCTGTGT TGGCTGCTAA GAAAACCAGA GAAATATGCT CATCTATTAA GAAATTTTTT GTAAGACAAG GATATTCCAA AAATGATTTC TTTTATATTA GTAGAGGTGA TTCAACCTTA CGTGGTCATG GTGTTTTAGA ACCTGCAATT CTTGCGGAAG AATTAGGACC ATTTGATGCA ACATTTCACA TTCCAGCTTT TTTGGAAGGT GGAAGAACAA CTGAAAACGG TATTCATTAT TTAAATGGTA TACCTGTTCA TACTACTGAT TTTGGTCATG ATAATATTTT TGGATTTTCT ACTAGTAATT TAGCTAAATG GATTGAAGAA AAAAGTTTTG GAAAGATTAA GGCGAAAAAT ATCTTGCACG TTAAAATTAA ACAATTAGAC ATGGCTTTTA ATGATGAAGA TGGTTTTGAA TCTCTTTTAA AATTTTTATC TAAATTAGAA AATAATACTT CAGTTGTTGT GGATGCTAAA TTACCTCATC ATCTAGAAAT ACTAGCTAGT GCGATCAAAG TAGTTTCTAA AGAAAAAAGA TTTCTTTTTA GGACTGCAGC AAGCTTTATA AAATCTTTAT CTGCATTGCC ACCTAACCCT AAATGTACTG CAGATTTAGT TTCTTTGAAA TCGAAAAATA ATGAGTATAA ATATAAGCCA GGTTTGATAA TAGTTGGCTC CCATGTGAAA TTAGCGACAG ATCAATTAGA GGTTTTAATG ATGGATAATT CCTGTAAAGG GTTGGAAATA CCAGTCAGTA AATTAGCTAA TATATTTGCT TTGGAAGATC GTAAACAGGC AATCTTAGAA ATTGAGTATA CTTTATTATC GAAAATAGAT GATATTTTGG ATTTAGAAAA AGTACCTGTT TTATATACTT CTCGAGAAGA AAGGAAGTTT TCTTCTTACT CTGAAAGGAT GACTTTTGGA CTTGAACTTG CAGAGTTTAT GTCAATTTTA GTTAGAAAAA TCACTAATAA ATTGGGTTAT ATTATTAGTA AAGGAGGTAT TACAACACAA ATCTTACTTC AGAAAGGGTT CAATTTTAAT CATGTACATT TAAAAGGACA AATATTACCA GGGTTGTCAA TAGTAACAGG TGATTCTGAT CAATATAATT TACCAGTAGT TACTTTCCCA GGCAATCTTG GAAATGAGAA AACACTACTA GAAGTATTAA AATTGATGGA TTCAATTTCT TAA
|
Protein sequence | MKIIIFDDDP TGSQTVYGCP LLLNWDEQNL EKAFKQSSPL IFILANTRSL SSVLAAKKTR EICSSIKKFF VRQGYSKNDF FYISRGDSTL RGHGVLEPAI LAEELGPFDA TFHIPAFLEG GRTTENGIHY LNGIPVHTTD FGHDNIFGFS TSNLAKWIEE KSFGKIKAKN ILHVKIKQLD MAFNDEDGFE SLLKFLSKLE NNTSVVVDAK LPHHLEILAS AIKVVSKEKR FLFRTAASFI KSLSALPPNP KCTADLVSLK SKNNEYKYKP GLIIVGSHVK LATDQLEVLM MDNSCKGLEI PVSKLANIFA LEDRKQAILE IEYTLLSKID DILDLEKVPV LYTSREERKF SSYSERMTFG LELAEFMSIL VRKITNKLGY IISKGGITTQ ILLQKGFNFN HVHLKGQILP GLSIVTGDSD QYNLPVVTFP GNLGNEKTLL EVLKLMDSIS
|
| |