Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_09561 |
Symbol | |
ID | 4780931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 879852 |
End bp | 881501 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640084233 |
Product | hypothetical protein |
Protein accession | YP_001014779 |
Protein GI | 124025663 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.241955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.10067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATTC CTTTATCTAA TGTTATTAAC TATATGAAGA TTAAAGCAGT AGTTTTAAAT TTTTTCTTAA AGAACAAAAA AAATATTTTA CCATTCTTTG CCGTTCTTCT TTTTATATCA CTTTATTTCC TTATCGATTT TTCTAGTCAA AGTCTTGTCG CTCATGACGA GGGACTTTAC GCGAGAAGAG CAAGATTGAT TGAGTCCTCT GCTAATTGGT TTTCCCCCCC ATTCATATCC CCTCACCATA AGACATTAGG AAGTTATTGG TTTATAGCAT TATCTATAAG GTTGTTTGGA AACAGTGAAC TAGCTCTAAG GCTTCCAAGT ATTCTTTCTT CTTTCCTTTG CTTAATAACC TCATACTTAA TAGCATTAAA AACAACAAAT AGTAAGTCTG CATTGATTTC TTTGTTCTCA CTTTCATCAA TGCCATTATG GATTCAATAT TCAAGATATG CGAGTCCAGA CTTACCATTT GTACTTTGTA CTCTATTAGC TATATTATTT TTCCTTAAAT CCCTAGATTC TTCAAAATAT ATAAGTCAGT ATTTTAATTT ATTTTGCTCT GGCCTTTTTA TTTCTTCTTC TTTTTTTATT AGAAGTTATA TGGCCTTTGT TCCTTTTATA GGACTAGCAC CTTTTATTTA TTATCATTTA TTTAGAAAAG AATTTATTTT TAAACTTTTC TTTTGTACTG GAATAGCTGT TGGATTTATA CCTACGTTTT TTAATCTGTA TTTTTCAGTA CAAAAGTTTG GCATCTCCGG AATTACAAGC CTTTTTGATT TTGCAAAGAA ACAGGCTCTC GGTGAATTTG CTTTTAATAA TTTATTACTC GTACCTGTTA ATTTTCTTTA TTTAACATTT CCTATTGGAA TATTACTTCT TATTCTTTTT TTATTCACTG GATCCAATAA TAAGGCTAAC TATCCATTAT TGATTTATTG TTATCCTCTT TCATCTTTAA TCTTACTATT ATGCATGTCT ACATCATATC CACATTATTA TCTTTTTCTC TTACCTTCTT TATCTATAAT CTTTGCAAAC TACCTAACAT CTAATTCACC TAGATATTCA TTTTCAAGCT CTATTATTAG ATATTTAGTC TTTATTGTAC TTTTAATAAT ATCATCTGTT ATATTATTTT CAATTCTTAG ATTTTCAGAC CAAGTACTTC TTTATTCAAG AGGAACTCCC GTAATAGTAT ATATTCTTAT CTCATTAATT GTGTTATCCT ATATTACTTC ACTTAGATTT CTATTTGATT TTAAATATCC AAATTCTAAT ATAATTAATT TCTTTTATAA TATAATCATA CCGCAATATA TTTCTTTATC ATTATTGTTT AATTTTGGTG TCCTAGGTAA TCCCAATCAT AATACGAAAG TTTTCCTGAA GGATGCTGAT GTTTCATCAA TTATAAACTC TAATACTATT TACCTATTTA GTGTTGAAAG TAAGATTCAG ACTTTACTAT CTTATTATTT ACCTTCCTCT GTGATAGTAG ATGATTTTGA ATTAATTAGT AAATATAAGT ATGTAATTAC TTCTAATGCT AATTCATTAG AGAAATTAAA ATTAAAACAA ATCTTTGTTC CTGTTAAAAA ATTTGACAAT CACTTATTAT TAATGAATAT TGACTCATAG
|
Protein sequence | MQIPLSNVIN YMKIKAVVLN FFLKNKKNIL PFFAVLLFIS LYFLIDFSSQ SLVAHDEGLY ARRARLIESS ANWFSPPFIS PHHKTLGSYW FIALSIRLFG NSELALRLPS ILSSFLCLIT SYLIALKTTN SKSALISLFS LSSMPLWIQY SRYASPDLPF VLCTLLAILF FLKSLDSSKY ISQYFNLFCS GLFISSSFFI RSYMAFVPFI GLAPFIYYHL FRKEFIFKLF FCTGIAVGFI PTFFNLYFSV QKFGISGITS LFDFAKKQAL GEFAFNNLLL VPVNFLYLTF PIGILLLILF LFTGSNNKAN YPLLIYCYPL SSLILLLCMS TSYPHYYLFL LPSLSIIFAN YLTSNSPRYS FSSSIIRYLV FIVLLIISSV ILFSILRFSD QVLLYSRGTP VIVYILISLI VLSYITSLRF LFDFKYPNSN IINFFYNIII PQYISLSLLF NFGVLGNPNH NTKVFLKDAD VSSIINSNTI YLFSVESKIQ TLLSYYLPSS VIVDDFELIS KYKYVITSNA NSLEKLKLKQ IFVPVKKFDN HLLLMNIDS
|
| |