Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_14121 |
Symbol | |
ID | 4911916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1179126 |
End bp | 1179884 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640161003 |
Product | flagellin modification protein A |
Protein accession | YP_001091636 |
Protein GI | 126696750 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.200573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGATT TTTTTTCATT AAGTGGCAAA ACTTTTTTAG TTACTGGAAG CGAAGGTTTA ATTGGCTTGG ATTTATGCAA AAGACTATTA GAGGCTGGTG CTGATGTAAT TTCTTGTGAT TTAAAAAGCC CAACAAATAC TTCTAATTCA GATAAAAATA GTTTTTTTGA ATGTGATTTA ACGACGAATA AAGGTTTAAG AGAGTTCAAG GATTTTATCA GAAATTGTGG GGTAAAAATT AATGGTTTCG TGCACTGTGC TTATCCTAGA CCAGAAAGTT GGGGTAAATT ATTTGATGAG ATATCTCTTG ATGAAGTCAG TTTGCATTTA AATATGCAAT TAAGCTCTTC TATAATTTTA TCAAGAATAA TTTGCAACTA TTTAAAAATA AATGGAGGGG GCAGTCTTGT AAATATTGCA TCAATACAAG GTATAGCTGC ACCTAAGTTT CATCACTATC GTGGAACAAG CATGAGTTCT CCCATAGAAT ATTCTTGTGT TAAATCTTCA ATAATAATAA TGACTAAATG GCTTGCTAAA TATTTTAAAA ATTCTAATTT AAATATTAAC TGTGTTAGCC CAGGAGGAGT ATTGGATAAA CAGTTAGAAA TTTTTCAGAA TAACTATAAA TCAGATTGCA ATAATATTGG CCTACTTAAA CCAAAAAATG TTTCATATCC TATTTTGTTT CTTCTCTCAG ATCAAGCAAA AGGTATATCT GGTCAGAATT TAATAGTTGA TGATGGGTGG TCTTTATGA
|
Protein sequence | MEDFFSLSGK TFLVTGSEGL IGLDLCKRLL EAGADVISCD LKSPTNTSNS DKNSFFECDL TTNKGLREFK DFIRNCGVKI NGFVHCAYPR PESWGKLFDE ISLDEVSLHL NMQLSSSIIL SRIICNYLKI NGGGSLVNIA SIQGIAAPKF HHYRGTSMSS PIEYSCVKSS IIIMTKWLAK YFKNSNLNIN CVSPGGVLDK QLEIFQNNYK SDCNNIGLLK PKNVSYPILF LLSDQAKGIS GQNLIVDDGW SL
|
| |