Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21041 |
Symbol | |
ID | 4781129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1748701 |
End bp | 1750359 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640085400 |
Product | hypothetical protein |
Protein accession | YP_001015924 |
Protein GI | 124026809 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.331475 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAA ACAGTACTGC AATTTATTAT CAATCTGATG CCTATACAAC TAGCAAGCAG AAGCTCATGG GTCGTAATGC TGCTGGTGAA TCTTTTCTTA GGGCTTATTT TAAATATGAT AATAGTCATA ATCTTTATGT TTATCCTGTA TCTCTAGAGG ATTTAGATTG TTTTAAAAGG AAAGCCATTG CTTACAATCG TCATGAACCT ATTGAGTGTA TAACCAAGAG ATCTTTAACT AAGCTTGTTG ATGTTGGAAA TCTTTTTGTA CCTGGTCCAG GTCTTGATCA ATTTGCTCAT GAACGTTGTT TCTCTGGTCA TAATTCTTGG AGTCTTTGCG GTATCACACA TACAACCAGC AGTATTAACG CTATGGATTG TATTTCCTCT CTCGCCACTG CTCCTATTCA AGAATGGGAT GCTTTGATTT GTACTAGTAA CGCTGTTAAA AAACATGTTA ATGAAACTCT TAATTCCCAA TTTGAATATT TAAAATATCG TTTAGGAATT TCAAAGCTCG TATTACCCCA ATTACCCGTT ATTCCTCTTG GAATTCATAC CTCTGACTTT CATTTTACCG ATTCCGAAAA GTTTTCTTCT AGAAATACTT TGGGCATTGA TGATAATTCA ATTGTGATTT TATACACAGG TCGATTATCT TTTCATGCTA AGGCTCATCC CCTAGCGATG TATCAAGCTT TGGAGTTATC TTCAAAACAA ACAAATATTC CTATTGTGTT AATTGAATGT GGATGGCATG CGAATCAGTC AATAGCAGAT TCTTTCACTG AAGCAGCTCA AAGATTTTGC CCCTCGATAA AAGTACTTCA TTTAGATGGT CGCATTAATA AAAATCGTTC TTTGGCTTGG TCTTCTGCTG ATATCTTCTG TTCTTTGCCT GATAATATTC AAGAAACTTT TGGAATCGTT CCCATTGAGG CAATGGCTGC GGGTTTACCC GTAGTAGTAT CTGATTGGGA TGGCTATAAA GATACAGTTA GAGATGAAAT AGATGGTTTT AGAATTCCAA CGTTAATTCC CGAAGAAGGT CTTGGTGCGG ATCTTATGCA AAGATATTCT CTGGGTATTG ATACTTATGA TATGTATTGT GGTCATACCT CAAGTCTGAT TTCCGTTGAT GTTTTATCTG CTAATAGAGC ATTTACAAAA TTGATTCAAT CACCTTCTCT TCGAGTCAAG ATGGGTGCAT CTGGTCTTAA AAGAGCACGA GAAATGTATG ACTGGTCAGT TATTTATAAA CAGTATGATG ACCTTTTTAA TCATTTAAAC CTGATTAGAA AGAGCAGTGT ACTTAATGAT TTTGACAAAC AACGTTTTTG GCCCGGACGA GTTAATCCCT TTCAGGGTTT TTCAGATTAT GCTACCAATC AACTTTCATT AAATTCAAAA GTGAGTTTAG TTGATGATGA TTTTCAAATT ACATTTCAAC GATACATTGA TATTAAGGAT CTAAAAATGG TCTCTTTTGC TTCATATATT TTACCAACAC ATGAAGAAGT TAAATGTATA TTTAATAATC TTAGTAAAGC TCCAATGAAA GCTTGTGATC TATTAATTCC ATTTGAACTA AAAAGAAGAC CTTTCATCTT AAGAACTTTA GTCTCTTTAC TCAAGTTTAA TCTTATTAAA CTAGTCTAA
|
Protein sequence | MSTNSTAIYY QSDAYTTSKQ KLMGRNAAGE SFLRAYFKYD NSHNLYVYPV SLEDLDCFKR KAIAYNRHEP IECITKRSLT KLVDVGNLFV PGPGLDQFAH ERCFSGHNSW SLCGITHTTS SINAMDCISS LATAPIQEWD ALICTSNAVK KHVNETLNSQ FEYLKYRLGI SKLVLPQLPV IPLGIHTSDF HFTDSEKFSS RNTLGIDDNS IVILYTGRLS FHAKAHPLAM YQALELSSKQ TNIPIVLIEC GWHANQSIAD SFTEAAQRFC PSIKVLHLDG RINKNRSLAW SSADIFCSLP DNIQETFGIV PIEAMAAGLP VVVSDWDGYK DTVRDEIDGF RIPTLIPEEG LGADLMQRYS LGIDTYDMYC GHTSSLISVD VLSANRAFTK LIQSPSLRVK MGASGLKRAR EMYDWSVIYK QYDDLFNHLN LIRKSSVLND FDKQRFWPGR VNPFQGFSDY ATNQLSLNSK VSLVDDDFQI TFQRYIDIKD LKMVSFASYI LPTHEEVKCI FNNLSKAPMK ACDLLIPFEL KRRPFILRTL VSLLKFNLIK LV
|
| |