Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_08721 |
Symbol | |
ID | 4776931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 793048 |
End bp | 794598 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640086381 |
Product | hypothetical protein |
Protein accession | YP_001016888 |
Protein GI | 124022581 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTCTCGA CAACTGCCAC CATTAAGCGA CAGTGGATAT GGATATTTGC CCTCTTGTCG CTCCTGATTG GATCATTCTG TATCAATTAC TGGGGCAACG ACTTTCCGTT GTGGTTGCAT CCGGATGAAA TCAAAAAAGT GATTTACATT AGTGGCGATA AAAAAACAGA TTTCTATCAC CCTCAATTAA TCCTTAAACT TGGCCGACTC TTTGCTGATA TAGCTTCTGC ATCCTCACAA CAAGATGTAG TTCAAGCCGG GCGCACGTCC ACGGCTTTTC TTGGAAGCTT TGTGGTGCTG TTCACTTTTG GTTTAGCTAG GCAGTTCTTA TCAGCACCAG CATCTTTGCT TGCCGCCAGC CTGACGGCGG CATCTCCAGG AATTGTCATT CATAGCCATT ACTTGAAAGA GGATATTGCT TTGACTTGTT TTGCAACAGC CTCTTTGCTG CTTGCAATAG TTACAGCAAA AGAATCAGTT GGAGATATCT CTTCCAAAAC TATCGATTAC AAAATCCCAG TTCTTTGGGG TCTTCTTACA GGTCTAGCGA TCGCCAGTAA ATACACAGCT TTTTTATTAT TTCCTTTTTA TTTCTTTCTA CCATGGCTCA TCAGGAAAGT TCATCGAAAA ACTTTTCATA AAAAACTGAT AATAAGCGCA TCAATCGCTG CGATTATTTT TTTGCTGATT AACAATCATC TATTTGCAGA CTTCAAAACC TTTACTTCGG GAGCAGAATT TGAGCTGAAT CATGTCTTAA GTGGTCACAA AGGGATAAAG ATTTACCCCT GGCATAATTT GATTCATCTC ACCAAAAATC TCCCCGATTC AATCGGACCT TTAACTTTAA TTCTGGGTTT GTTTGGTATG AGCGGAGAGA TTTGGAGGTC GAGAAGAGCA AATATCTCCC GTCGGATAAC TTTTTATTTC ACAATCTATT TTTATATTAT TTTGGAATGT TTACCACTGA AACCGCCACC TGGAGATGCA CGTTATGTCC TTCCACTGAT TCCTCCGCTC ACCTGCTTTG CTGCATCTGC AGTTAGTGAG TTGAGTAATT GGTTGAAATC TAGAATTAAT TTTTCCTTGT CAATAGCAAT AGCTTTAAGT GTGGCAACAT TAATACCATC GATTTCTAGG AGCTTAGAGC TAAACCAATA TCTAAGGGAT GATATAAGAA CTCGGATTCC AGCAGAAATC GGAACATATT TAAAGCCCAT ATGGATGGAT GGTTATGCGG GCTACCATTT TCTTGAGGAT TCAGTTCAAG CGGAGATTTT AACAAAATCA GATAATAAAT TTGATACAAA TATTTCGCTT GAATCCCCCC AGGAAAAGAT CTGTACGTTC ATTACTAGTA GTTTCATTTA CGAAAGGTAT TTGAGGGCAT CAAGGTTTTC TGGCCAAGGG GAAAACGTAT ATGAAATGAG TCACTTTTAT CAACGTCTTT TTGACCGGCC ATTTAAGGAC CTTAATCCTA ATGAACCGTC GTATTCGTTT GTCAATCCCA CTCTCCGAAT CGTCAACCTC TGCCAATCAA AAACAGAGTG A
|
Protein sequence | MVSTTATIKR QWIWIFALLS LLIGSFCINY WGNDFPLWLH PDEIKKVIYI SGDKKTDFYH PQLILKLGRL FADIASASSQ QDVVQAGRTS TAFLGSFVVL FTFGLARQFL SAPASLLAAS LTAASPGIVI HSHYLKEDIA LTCFATASLL LAIVTAKESV GDISSKTIDY KIPVLWGLLT GLAIASKYTA FLLFPFYFFL PWLIRKVHRK TFHKKLIISA SIAAIIFLLI NNHLFADFKT FTSGAEFELN HVLSGHKGIK IYPWHNLIHL TKNLPDSIGP LTLILGLFGM SGEIWRSRRA NISRRITFYF TIYFYIILEC LPLKPPPGDA RYVLPLIPPL TCFAASAVSE LSNWLKSRIN FSLSIAIALS VATLIPSISR SLELNQYLRD DIRTRIPAEI GTYLKPIWMD GYAGYHFLED SVQAEILTKS DNKFDTNISL ESPQEKICTF ITSSFIYERY LRASRFSGQG ENVYEMSHFY QRLFDRPFKD LNPNEPSYSF VNPTLRIVNL CQSKTE
|
| |