Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_10751 |
Symbol | |
ID | 4781013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 989843 |
End bp | 990880 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640084354 |
Product | hypothetical protein |
Protein accession | YP_001014898 |
Protein GI | 124025782 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0490933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0263054 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATAAGC AAGTAGAAAT TTTGGAGTCA GATGCAAAGG AAAAGCTTGT TGTAGCAGTG ACTATGGCCA CAGGCAGGCA AGGTATTGGT GTCGTTAAAG AATTAAGCCA AACAGATAAA TACCAAATAC GTGCTATCAC AAGAAATATA AAGAGTACAA AGGCTCTAGA GCTAGGAAGC CTAAACAACG TTGAACTAGT CAAGGGAGAT TTAATGGATC CTGAAAGCCT TAAAAAAGCT TTTGAAGGAG TTGATGTGAT TTTTGGAAAT ACAACACCTA CAAAAGGATG GAAATTATTT AGAGGAAGTA TCGTCAGATC TTATGAAATG GAACAAGGTT ATAACTTAAT AAATCAAGTC AAAACTGCCT ACGAAAAAGG ATGTCTAAAT CACTTTATAT TTAGCTCAAT TAGTAAAGCA AAAGACCCAC TAAAAAATGA TCCTGCTCCA GGACATTTTA CGAGTAAATG GGATATTGAA GAATATATAG AAAAATCAGG TCTTAAAAAA ATTACTACTG TATTAAGACC CGTTAGCTAC TTTGAAAACT TTGAAAACAA ATTACCTGGC TATACAATTT CAAAGAAAAT TTTTCCAGGA ATAGTTGGCA AGAATTTTAA GTGGCAAACA ATCGCAGTAG AAGATGTAGG TAAATGGGTT AGAGGTGTTT TATCAAAATC AGAGAAATAT AAAAATCAAT CTATCAATAT TGCCGGCGAG GAACTAACAG GACTGGAAAT GGCTATGACA CTTCAAAGAA TAGTTTCTTC AGAAGGACTA AAAACAAATT ATGTGATGAT CCCTAGATTA GCAATTAAGT TATTGGAATA CGACATTGGC GTTATGGCAG ATTGGATTGA AAGATCAGGC TATGGAGCTG ATATGAATAA TCTTCAATCG ATTCAGGAAG AGTTAAATAT TGCTCCTACA TCACTTAAAG ACTGGCTAAA GACAAAACTT AAAAAACAAA CTAAGAAACA AAATTCATGG GCAAGGCAGT GGAAATCATC TCAGTGGAAA CTTCAATGGG ATAAATAA
|
Protein sequence | MHKQVEILES DAKEKLVVAV TMATGRQGIG VVKELSQTDK YQIRAITRNI KSTKALELGS LNNVELVKGD LMDPESLKKA FEGVDVIFGN TTPTKGWKLF RGSIVRSYEM EQGYNLINQV KTAYEKGCLN HFIFSSISKA KDPLKNDPAP GHFTSKWDIE EYIEKSGLKK ITTVLRPVSY FENFENKLPG YTISKKIFPG IVGKNFKWQT IAVEDVGKWV RGVLSKSEKY KNQSINIAGE ELTGLEMAMT LQRIVSSEGL KTNYVMIPRL AIKLLEYDIG VMADWIERSG YGADMNNLQS IQEELNIAPT SLKDWLKTKL KKQTKKQNSW ARQWKSSQWK LQWDK
|
| |