Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_00441 |
Symbol | |
ID | 4716726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 44316 |
End bp | 46091 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640077741 |
Product | flavoprotein |
Protein accession | YP_001008439 |
Protein GI | 123967581 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0426] Uncharacterized flavoproteins [COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGCCT CTGCCCAGAC AAGTAATTCT AAATTGGCAC CAAATAATAG CAAGTTGACG GTTCAATCTC AAAATTTTGC TGATGATTCT TGTGCCATAA GATCTTTGGA TTGGGATCGT AGTAGATTTG ATATTGAATT TGGTTTAAGA AATGGAACTA CTTACAATAG TTTTATTATT AAAGGCGAGA AATTAGCAAT TATTGATACT AGTCACGCAA AGTTCGAAGA ATTATGGTTT GAAGAATTAC TGAAAAAGGT AAATCCGCAA GAAGTTGATT ATTTAATTAC TAGCCATACA GAACCTGATC ATTCTGGTTT AATAGGTAAT CTTTTAGAAT TAAATCAAAA TATCACAGTA GTTGGATCAA AATTAGCACT TAAATTTATT GAAGACCAAA TACATATTCC CTTTAAACGT CTAGAGGTCA AGAGTGGAGA GTTTTTAAAT CTCGGAACTA ATCCTAATAG TGGTTTACAA CATAATATTG AATTTATAAG TGCACCAAAT TTACATTGGC CAGATACCAT ATTTTCATAT GATCACAGCA CAAATGTTCT CTATACATGC GATGCATTTG GTCTCCATTA TTGTTCTGAC GAATTTTTTG ACACTGATCA AAAAGAAATA TACGATGATT TCCGTTTTTA TTACGATTGC CTTATGGGTC CAAACGCTAG AAGCGTTATG CAGGCAATTA AAAGAATAGA TAAGCTACCT AAAGTAAAAA CAATAGCCGT TGGTCATGGG CCTTTGCTCC ATAATCAGGT CAATTTTTGG AAAGGAAAAT ATCTAGAATG GAGTAGTAAT AAAAGCAAAG GTAATGATTT TGTGTCAGTC TGCTACATAA GCGACTATGG TTATTGTGAT CGACTCAGTC AAGCAATATC TCATGGAATA AGCAAAGCAG ATGCACAGGT TCAATTAATT GATTTAAGAT CTTCTGACCC GCAAGAATTA ACAAGTTTAA TTTCCGAATC AAAAGCAGTA GTCATTCCCA CATGGCCAGT AGACTCAGAT AATGAATTAA AAGAATCTCT CGGTACTTTA TTTGCAGCAC TAAAACCAAA ACAATTTACT GCAGTTTATG ATGCATTTGG TGGAAATGAT GAACCAATAG ATTCCTTAGC AAATAAATTA AGAGAACTTG GTCAAAAAGA AGCTTTCTCT CCATTAAGAG TTAAAAATAT CCCAGATCCC ATTGTTTATC AACAATTCGA AGAAGCTGGA ACTGACTTGG GTCAATTGAT CAATAAAAAG AAGAATATTG CCTCTATGAA GAGCCTTGAT TCAAATTTAG ATAAAGCATT AGGGAGATTA AGTGGAGGAT TATATGTAGT TACAGCGAGC CAAGGCGAAG GTTCTACATT CAGACAAAGT GCAATGGTCG CAAGTTGGGT TAGTCAAGCA AGCTTTTCTC CACCAGGTAT TACAGTTGCA GTAGCAAAAG ATAGAGCTAT TGAATCATAT ATGCAGGTTG GGAAAGGTTT TGTTGTGAAT ATTTTAAGGG AAGATAACTA TCAAAAAATG TTCAGACATT TTTTAAAAAG ATTTGCCCCT GGAGCTGATA GATTCGCAGA TGTAGATGTA ATTAGCAACA TCGCTGAAGG AGGACCAGTT CTTTCAGATT CACTCGCCTT TTTAGATTGT AAAGTTAGTT CCAGGCTAGA GACTCCAGAC CATTGGATAA TTTACGGAAT TGTTGAAAAT GGTAATGTTT CTGACTTATC ATGCAAGACA GCAGTTCATC ACAGAAAAGT TGCTAATCAC TATTAG
|
Protein sequence | MIASAQTSNS KLAPNNSKLT VQSQNFADDS CAIRSLDWDR SRFDIEFGLR NGTTYNSFII KGEKLAIIDT SHAKFEELWF EELLKKVNPQ EVDYLITSHT EPDHSGLIGN LLELNQNITV VGSKLALKFI EDQIHIPFKR LEVKSGEFLN LGTNPNSGLQ HNIEFISAPN LHWPDTIFSY DHSTNVLYTC DAFGLHYCSD EFFDTDQKEI YDDFRFYYDC LMGPNARSVM QAIKRIDKLP KVKTIAVGHG PLLHNQVNFW KGKYLEWSSN KSKGNDFVSV CYISDYGYCD RLSQAISHGI SKADAQVQLI DLRSSDPQEL TSLISESKAV VIPTWPVDSD NELKESLGTL FAALKPKQFT AVYDAFGGND EPIDSLANKL RELGQKEAFS PLRVKNIPDP IVYQQFEEAG TDLGQLINKK KNIASMKSLD SNLDKALGRL SGGLYVVTAS QGEGSTFRQS AMVASWVSQA SFSPPGITVA VAKDRAIESY MQVGKGFVVN ILREDNYQKM FRHFLKRFAP GADRFADVDV ISNIAEGGPV LSDSLAFLDC KVSSRLETPD HWIIYGIVEN GNVSDLSCKT AVHHRKVANH Y
|
| |