Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01141 |
Symbol | |
ID | 4777679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 114258 |
End bp | 115040 |
Gene Length | 783 bp |
Protein Length | 260 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640085613 |
Product | flagellin modification protein A |
Protein accession | YP_001016134 |
Protein GI | 124021827 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.667394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGT CTCTAAGTCT GAAGGATGAA TGTATTTTAA TAACAGGAGC GGCAGGAAGA ATAGGTTCAG CTGTCGCTAA AATTGCTTTT CACGCGGGAG CAAATATAAT ACTAACCGAT ATAAATTATA AGGCTATGAA TGATAGTGCA AAAGAATTGA TAAAAATAGA TGGCAGCAGG GTCCATTGTC TTGAGTGCGA TATTACAGTA GCAGAGGATA TTGAAAAAAT GATAAGCTCG TGCTTATCTG TTCATAGTGT TGTTAATGGG GCAGTGCACA CTGCATATCC AACGTCTAGA GGATGGGGAG CAAGATTTGA AGACATTAAA CCTGAAGATT TATATTTAGA CCTCAATATG CAGCTAGGGG GATCAATACT ATTTTCACAG AAAATATTAA AATGCTTTGA AAGGCAAAAG AAGGGTAGCT TAATTCTTGT ATCTTCTATT CAAGGCATTA ATGCGCCAAA GTTTGAGCAT TATGAAGGCA CTTCTATGTA TTCTCCAATA GAATATGCCG CGATTAAAGC TGGTATTATT TCCATCACCA GGTGGCTTGC AAAATATTAT TCGGATAAAG GTATTCGTGT GAATTGTGTC AGTCCTGGCG GAATAAAAGA TAATCAGCCT GTTGCATTCC AACAGAAATA TAAAAAGAGT TGTACTAATA TCGGGTTGTT AGAGAGTGAA GATGTAGCTC ATACTATAAT ATTCTTATTA TCCAGTGCAG CATTTGCAAT CAATGGCCAG AATATTATTA TTGACGATGG ATGGGTACTT TAA
|
Protein sequence | MDKSLSLKDE CILITGAAGR IGSAVAKIAF HAGANIILTD INYKAMNDSA KELIKIDGSR VHCLECDITV AEDIEKMISS CLSVHSVVNG AVHTAYPTSR GWGARFEDIK PEDLYLDLNM QLGGSILFSQ KILKCFERQK KGSLILVSSI QGINAPKFEH YEGTSMYSPI EYAAIKAGII SITRWLAKYY SDKGIRVNCV SPGGIKDNQP VAFQQKYKKS CTNIGLLESE DVAHTIIFLL SSAAFAINGQ NIIIDDGWVL
|
| |