Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07091 |
Symbol | |
ID | 4780303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 651651 |
End bp | 652661 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083983 |
Product | FAD linked oxidase, N-terminal |
Protein accession | YP_001014532 |
Protein GI | 124025416 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.105041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.609294 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAT TATGCTCTTT TCTACAAAAG CATAAAAGAA GTTTTCCAAT TGGTTTGTCT GGAAATACAG GAATGGGATA CATCTTGACT GGTGGAATAA GTCCACTCAG TAGGAGCAAA GGATTAGCAA TTGATCAAAT CATAGAGATT AAGGGTTTTT GGGGAAACGG AGAAGAGTTT CATTTACTAC GACCAAATAC GAAAAATGAG TTGACTAATG AATGGAAAGC CCTTTGCGGA GCAGCTATAT TTCTTGGCAT AATCACACAG GTAAAGTTAA AGACTCAGCC ATTAAGACCA CTATTAAGCT GGACAGCAAA CCTTTCATTT TCTCAGCTAT CTGAATGTAT TAATCAAGCC GAAAGTTGGC CTAATTCTCT TAGCCTTCAA TGGATATATG GAGACGATAT TTTTGCTCAT GCAATTGGTG AAATTGAAAA TAATGATGAT GAATCCGTCT TGATTAAATT ATTAGAAAAA TTACCATTCT CTCGAAATAG AATCATTAAT AAATTCAATA ATATGAAGTC TTTACCTAAT TTAAGCCTTG GAGATAATAA TAATTATAAT CATTCAAATC ATTCTGAAGT GCTTGGGTTA TTAGGCCCTG CTTGGCAAGA AAAAAATCAA CAAGTATTAA AAATCCTTAA AGAATTAATA AATAAAAGGC CGAACAAAAG TTGTTATATA GCTTCTCAAC AATTAGGAGG TTTAACACAT TTAAATGATC TTGACACTTC TTTTATTCAT CGTGATGCAA TCTGGAAACC TTGGATTAAC GGTGCTTGGG AAGCCCACAA TCAAGCCGAG AGAAAAAGAA CTCTGGAATG GATGACAGAG TGTTGGAATA ATCTAGAATT CATATGCCCT GGGGTTCATC TTGCGCAAAT ACACCCACAT TTAGAATGGC ATAAAAAAGA ATTATCATCT GCATTCAAAG ATTGGCTCCC AAACTTAGAA GAGCTCAAAG CCATTCATGA TCCAGAAAAT ATAATGCCAC CATTAAAATA G
|
Protein sequence | MGKLCSFLQK HKRSFPIGLS GNTGMGYILT GGISPLSRSK GLAIDQIIEI KGFWGNGEEF HLLRPNTKNE LTNEWKALCG AAIFLGIITQ VKLKTQPLRP LLSWTANLSF SQLSECINQA ESWPNSLSLQ WIYGDDIFAH AIGEIENNDD ESVLIKLLEK LPFSRNRIIN KFNNMKSLPN LSLGDNNNYN HSNHSEVLGL LGPAWQEKNQ QVLKILKELI NKRPNKSCYI ASQQLGGLTH LNDLDTSFIH RDAIWKPWIN GAWEAHNQAE RKRTLEWMTE CWNNLEFICP GVHLAQIHPH LEWHKKELSS AFKDWLPNLE ELKAIHDPEN IMPPLK
|
| |