Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07141 |
Symbol | |
ID | 4781234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 657132 |
End bp | 659228 |
Gene Length | 2097 bp |
Protein Length | 698 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640083988 |
Product | hypothetical protein |
Protein accession | YP_001014537 |
Protein GI | 124025421 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.340212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.245946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTATTA AGAAAAATAT TAATCACTCT AAATATATTT TTTGGATTTC AATTTTATTT ATATGGATTC TTTCGACCAT AATTGACCGT ATCTGGTGGA ATTTATATAG CATTACTCCT TCATGGGATC AGGCTGACTA TCTCAATAGT GCCCTTGACC ATGGCCGTGC ACTCTCTTTT TTGGGAGCAG ATGGAGCTTC AGACTTTAAT TCTTTACTAG ATAAATCGCC AAAAATTCCT CCTTTGGCTT CAATAATCAA TGGAGCCGTA ATTACCTTTG CTGGTGATGC TCCTCATCAG GCGGCCTGGT CCTTAAGTTT TTGGAATGGA TTCTTTATCT TTAATATTGC TTCGTGGGGA CTTTATTTGA GTGGGAAAAA ACTTGGACTT TTTTGTGTTC TTATCAGTGC ATTTTCTCCT TTCTTATTTA ATCTAAGAAC TGATTATGTA TTGGAGTTAC CTTTAATTTC CGCTATTACA TTTTATTTGT TTCATCTAGG AAGGTGGAGT GATAAATCAA TCGGAGGTAA ATGGATTCAA TTGATAATTG CTACTTTCGC ATGCTCTTTT TCTTTATTGA TTAAGCAAAG TTCTTTATTA GTTATCATAC CTTCTTTATT ATTTGTTTTT GTGCTTTCTT TTAAAAGAGA TAAAAAATTT CGATTACAAT TTTTATGCTT AGTTCTTATA AATATTTTAG CAATTTTACC TTGGTTTTTT CACAATTGGA TAATGATATT AAGTGGAACT TATAGAGCTG TTTTTGAATC GGCGGCGATA GAAGGTGATC CTTCTATTTT AGGTTTTAAA AGTATTTTCT GGTATTTTCC ATATTTAGAT AATCAGTTTG GAATTATTAT TTTCGTTTTT GGATTGTCAG GAATACTATT TGCATTTTTA ACCTATTTAA GATCTTTTAG ATCTCAAGCA AGATTAGTTG ATATTTTTAA TGAGAATAAT TATAAATGGA CATGGATTTA TTTTAATTTA ATAACATGCT GGACTTTTAC AACTTTCATT CCTAACAAGG ATGAAAGATA TATAGCATGT ACAATCCCGT TAATTATTTT ACTGCTAGGC TTTGGATTTA CTAAGTGGAG TGATTGGCTA GGTACTTATT CTAAATTAAA CTCTTATATT TTATTATTTA TTCCTGCTGT AAGTTTTCTA TTTTCCAATT CTATTAATAA GTTTAACGCT CTACAAAATA TTACAAGTAA ATATTATCCT GTTAAAGATA TTTTATCGAT AGTTAGATCT GATCAGTCTA TCGATAAAAA AGAAACAGTT ATTGTTGTTC CAAGCACCCC TGAAATTAAT CAGCATAATG TAAGCTATTT TGGAAGAATG CAAGGTGGAA ATATTTTAGG CAGACAACTT GGGCAATCTC TTTTGCATAT AGAACCAGTG CTTAAATACT CTAATTGGAT TATTTTGGCA GACGGAGATC AAGGCTCAGT TCCAAGTAAT TCACTAGTTC TAGACAAAGC GATTAGAGAT AGTTCTCTTT TTATACAAGT TCAAGAATTT CCTAGAGAAC AAGAGGGAAG CTATTCTCTT TGGAAGCGAA GATCAAGTTC ATTTAATCCA AATGAATTTC ATAATAGATT TATTGAACTA GCAAAGGGGA TGGAGAAAGG TCCATTAGGT ATTAAATTGA TTTTTGATGA AATAGAAATA GAACATATGC TTGATGGGCA TTTGAAATAT CAAAGTATAG TTAGAGATAA GGCATTATCC AAAATAAGTT CAGACCCTGA AAATGTTGAA TCTTTATGGT CCTTATCGCT TTTGAAGATA TTATCGAATA GACCTTATGA AGCTGATATT TATTTAAGAA ATTTAGAAAT CTTGTTGCCA AATAATCCTT GGCCAAGTGC TTATAGAATA ATAGTTAACT TTGCCTCTTG GAATCCTTGG AAGGCCTCCT TAATAGCCGA TAAGGCTAAT AAAAGAAATC CAAATTACTT TCTAAAAAGT TTGAGTGATA TAAGTGCAAT TTTCAGGGGA TCCTTTTGGA GAATAAAGTC TGCTTTAAAT AGTGTTCCGA ATGCAATAAA AAGTGTTGAT GAATCTCTAA AACCAATAGA AAAATAG
|
Protein sequence | MLIKKNINHS KYIFWISILF IWILSTIIDR IWWNLYSITP SWDQADYLNS ALDHGRALSF LGADGASDFN SLLDKSPKIP PLASIINGAV ITFAGDAPHQ AAWSLSFWNG FFIFNIASWG LYLSGKKLGL FCVLISAFSP FLFNLRTDYV LELPLISAIT FYLFHLGRWS DKSIGGKWIQ LIIATFACSF SLLIKQSSLL VIIPSLLFVF VLSFKRDKKF RLQFLCLVLI NILAILPWFF HNWIMILSGT YRAVFESAAI EGDPSILGFK SIFWYFPYLD NQFGIIIFVF GLSGILFAFL TYLRSFRSQA RLVDIFNENN YKWTWIYFNL ITCWTFTTFI PNKDERYIAC TIPLIILLLG FGFTKWSDWL GTYSKLNSYI LLFIPAVSFL FSNSINKFNA LQNITSKYYP VKDILSIVRS DQSIDKKETV IVVPSTPEIN QHNVSYFGRM QGGNILGRQL GQSLLHIEPV LKYSNWIILA DGDQGSVPSN SLVLDKAIRD SSLFIQVQEF PREQEGSYSL WKRRSSSFNP NEFHNRFIEL AKGMEKGPLG IKLIFDEIEI EHMLDGHLKY QSIVRDKALS KISSDPENVE SLWSLSLLKI LSNRPYEADI YLRNLEILLP NNPWPSAYRI IVNFASWNPW KASLIADKAN KRNPNYFLKS LSDISAIFRG SFWRIKSALN SVPNAIKSVD ESLKPIEK
|
| |