Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02451 |
Symbol | |
ID | 4779607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 225846 |
End bp | 226994 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083510 |
Product | NifS-like aminotransferase class-V |
Protein accession | YP_001014074 |
Protein GI | 124024958 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAATA ATCACCTAAT TTTTGATTTT CAGTCTTCAA CACCATGTTG TACGAAGGTT GTTGAGGAGA TGGCTCCTTA TTGGAATGAG TTGTGGGGTA ATCCCTCTAA TACTAATAAT CGATCCGGTG TTTTTGCTTC AGCTGCAGTT GAAGTATCTC GTGAAAAAAT AGCTTCATAT TTGAATATCA ATCCGAAAAG ATTAATTTTT ACGAGTGGGG CAACAGAAGC GAATAATTTG GGTTTAGTTG GACATGCAAG AGCTAAAGCA CAACTGATTG GAAAACCTGG ACACATAATT ACCGTTTCAA CTGAACATCA TGCAGTCCTT GATCCGCTTA GGCAGCTTCA AAGAGAAGGG TTTCGCTTGA CAGAATTGCA ACCGAATAAA GAGGGTTTAA TCAATATTGA ACAACTTTCT GAAGCTTTTG AAAAAGATAC TTTTCTGGTT AGCGTCATGG CTGCAAATAA TGAGATAGGG GTTTTGCAAC CTATTGGTGA CATTGGGTCT TTTTGTAAGA GAAAGGGAAT CGCTTTTCAT TCTGATGCCG CTCAAGCCTT TGGATACTTA GATTTAGACC CCGATAAATT TCGCATTGAT TTGATGAGTT TGAGTGCTCA CAAAATCTAT GGACCTAAAG GTATTGGGGC TTTGGTGATT AGGGAAGGAT TCCCTCTTGA ACCCTCTCAA TATGGAGGAG GTCAGGAGCT TGGATTAAGA TCTGGCACGC TCCCTGTCCC GTTAATTGTT GGCTTTGCAA AGGCAGTCGA AATAACAAAA AATGATCAGG ATGAGAGGAA CAAGAGACTT TTGTTTTTTA GAAACTTATT GTTGAGTGGA TTAAAAAAGA ATATTTCTGG ATTGATAGTT AATGGATCTA TTGATCAAAG ATTGCCTCAT AATCTCAATA TCACTTTCCC TGGCGTGAAG GGAAGTCAAT TGCATGGTCA ATTAAGAAGG TTTATTTTTT GTACTAGTGG TTCAGCTTGT AGTAATGGTG AAGCTTCTCA TGTTCTGCAG GAGATAGGGC TCAGCAAAAA AGATGCTGAG GCGTCAATAA GGATGAGTAT TGGGAGAAAT ACTACGGAGA AAGATATACA TAAGGCTATT AATATTATTA CGAATATAGT GATTAATCTT AGACAATAA
|
Protein sequence | MDNNHLIFDF QSSTPCCTKV VEEMAPYWNE LWGNPSNTNN RSGVFASAAV EVSREKIASY LNINPKRLIF TSGATEANNL GLVGHARAKA QLIGKPGHII TVSTEHHAVL DPLRQLQREG FRLTELQPNK EGLINIEQLS EAFEKDTFLV SVMAANNEIG VLQPIGDIGS FCKRKGIAFH SDAAQAFGYL DLDPDKFRID LMSLSAHKIY GPKGIGALVI REGFPLEPSQ YGGGQELGLR SGTLPVPLIV GFAKAVEITK NDQDERNKRL LFFRNLLLSG LKKNISGLIV NGSIDQRLPH NLNITFPGVK GSQLHGQLRR FIFCTSGSAC SNGEASHVLQ EIGLSKKDAE ASIRMSIGRN TTEKDIHKAI NIITNIVINL RQ
|
| |