Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1074 |
Symbol | |
ID | 5704342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1203046 |
End bp | 1206114 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270589 |
Product | FAD linked oxidase domain-containing protein |
Protein accession | YP_001535973 |
Protein GI | 159036720 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.899734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000498092 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCTGCCCC AGCCCGTCGT GCGCGCACCC GGGCCGGCAC CCGAGGTCGA CCTGGCCGCG CTCGTCGCCG ACCTGCGGGC CGAGGTGGAT GGAGAGGTCC GGTTCGACGT CGGTTCGCGG GCCGCCTACT CCACCGACGC CTCCAACTAC CGGCAGGTGC CACTCGGAGT GGTGGTGCCC CGTACGGTCG AGGCGGGCGT GGCGGCGGTC GCGGTGTGCC GCCGGCACGG CGCTCCGCTG GTTTCCCGGG GCGGCGGCAC CAGCCTGGCT GGACAATGCA CCAACACCGC TGTCGTGCTG GACTGGTCGA AGTACTGCCA CCTCCTGCTG GAGGTCGATC CGCAGGCGCG GACCTGCCTG GTGGAACCCG GCATCGTGTT GGACTCACTC AACGCCCAAC TCGCCTCGAC CGGGCTGGAG TACGGTCCCC GCCCGGCCAC CCACAGTCGC TGCACCCTGG GTGGCATGCT CGGCAACAAC TCCTGCGGAG CCACCGCACA GCGCACCGGG AAGGTTGTCG ACAACGTCGT CGAACTGGAG GTCCTGCTCT ACGACGGCAC CAGGTTCTGG GTGGGCGAGA CCAGCGACGA GCAGTACGCC GAGATCCAGC GCCGCGGCGG GCGGCGGGCG GAGGTCTACC GTCAGCTGCG GGCACTACGC GAGGAGTACC TGGCCGACAT CCGTACCCGC TACCCGGACA TTCCTCGCCG GGTGTCCGGG TACAACCTCG ACAGTCTGCT GCCGGAGAAG GGCTTCCACA TCGCGCAGAC CCTGGTCGGC TCCGAGGGCA CGCTGGTCAC CGTTCTCCGG GCACGGCTGA GGCTGGTGCC GGTGGTGCGG GCGTCCGCCC TGGTCTTCGT CAACTACCCC GACATCGCGG CTGCGGGCGA CGACGTCATG CGGGTGCTGG CACACCAACC GGTCACCCTG GAGGGGATCG ACCACCGGCT CGTCGCCGAC GAACGGCGTA AGCACCAGCA ACTGGCAGGG CTCCGGGAGA TCCCCGAGGG CGGCGCCTGG CTGATGATCC AGATGGGTGG CGACACGCCA GCACAGGCCC GCGCCGCCGC CAACCGGTTG ATCACCGCCG TACGCGGCGG CGGTTCCGGG ACCGTGCACG AGTTCACCGA TCCTGCCCAT GAACGGCAGA TGTGGCAGGT CCGGGAGTCA TCCCTCGGGG CCACCGCGCA GGTGCGAGGT GCCGACCGTA CGTGGCCGGG CTGGGAGGAT TCGGCCGTTG CCCCGGAGAA GCTTGGCAGC TACCTGCGGG ACCTGCGACG GCTCTTGAAC GAGTACGGCC TTGGGCAGGC GTCGTTGTAC GGCCACTTCG GTCAGGGATG CGTGCACACG CGTATTCCGT TCCAGCTGAC CACCGCCGAC GGGGTGGCGC GGTTCCGGTC CTTCCTCGAG CGTGCCGCCG ACCTGGTCGT CTCCTACGGT GGATCCCTCT CCGGTGAGCA CGGGGACGGC CAGGCCCGGG GTGAACTGCT GCCGAAGATG TACGGCAGCC GGCTGATGCG CGCGTTCGGC CAGCTCAAGG CGATCTTCGA TCCGGCTGAC CGGATGAATC CGGGTAAGAC GGTGTCGCCC TACCCGCTCG ACAGCCACCT GCGGTTGGGG GCCGACTACC ACCATCCTTC GCTGCGAACC ACGTTCGCCT ACCCCGACGA CCAGGGCAGT TTCGCCAACG CCGTACTGCG CTGCGTCGGG GTGGGCAAGT GCCGCCGCCA CGACGGTGGG GTGATGTGCC CGTCCTACAT GGTCACTCGT GAGGAGGAGG ACTCCACGCG GGGCCGTTCC CGGCTGCTGT TCGAGATGCT CGACGGCAGC GTCCGGGGCG GCAGCATCGA CGACGGCTGG CGCTCCGACG CGGTGCGCGA CGCCCTCGAC CTCTGCCTGG CCTGCAAGGG GTGCAAGGCG GACTGTCCGG TGAACGTGGA CATGGCGACC TACAAGGCGG AGTTCCTGTC CCACCATTAC GCGGGCCGGT TACGTCCCCG CGCCCACTAC TCGATGGGGT GGCTGCCGGT GCTGGCGGCG GTGGCCGGGG TCGCGCCGGG CGCGGTGAAC GCCCTCACAC AGGCGCCCGG CCTGGGCCGG CTCGCCAAGT TCGTCGGCGG TATCGACCAG CGCCGGGACG TACCGACCTT CGCCGGGGAG AGCTTCCAGC GGTGGTTCGC CGACCGGACC CCGGCTGGGG ACGGCCACCG CGGCGAGGTG CTGCTCTGGC CGGACACCTT CACCAACCGA TTCCATCCCG GTGTGGCCCA GGCAGCGGTC GAGGTGCTGG AGGCCGCCGG ATGGCGGGTT CGGGTGCCGG ACCGGCCGGT CTGCTGCGGG CTGACCTGGG TCTCCACCGG CCAACTCGGC GTCGCCACGT GGATGCTGCG GCGGACCCTG AACGTCCTTC GGCCGCACCT GCGGGCCGGT ACCCGGGTGG TCGGTCTGGA ACCGAGCTGT ACGGCCGTGT TCCGCAGTGA CGCCCACGAG CTGTTCCCGG ATGACGAGGA CGTCACCCGC CTCCGCCAGC AGACGGTCAC CCTGGCCGAG CTGCTCCATG ACCACAGCCC TGGCTGGCGG CCACCGCGGC TACCGGCGCA CGCGCTGATC CAGACCCACT GCCACCAGCA CGCCGTCCTG GGTACCACCG CCGACCAGGC AGTGCTCACC GGCGCTGGGG TGGAAGCCGA CTTCGTCGAC TCGGGCTGCT GCGGGTTGGC CGGCAACTTC GGCTTCGAGC AGGGGCACTA CGAGGTCTCC GAGGCATGTG CCGAGCGGGT GCTGCTGCCA GCCGTTCGGG ACGCCGCCGG CACCGACGTG ATTCTCGCCG ACGGGTTCAG CTGTCGAACC CAGGTGGAGC AGAGCGCGGC TGGCGGACGA TCGGCGCTGC ACCTGGCCGA GTTCCTGCGA GCCGGGTTGC ACGGCGAGGC GGTGACGCCC TGGCCGGAAC GTCGGTGGGG GCGCCGTCCG CAGCCGCCTA CCCGGGCGGC CCGGCTGGCC GCGGTCGGGC TGCTCGGCTT GGCCGTCCTC GCGCCGGTGG TCGCCCTCGT CGCGTCGAAG GCTCGGTGA
|
Protein sequence | MLPQPVVRAP GPAPEVDLAA LVADLRAEVD GEVRFDVGSR AAYSTDASNY RQVPLGVVVP RTVEAGVAAV AVCRRHGAPL VSRGGGTSLA GQCTNTAVVL DWSKYCHLLL EVDPQARTCL VEPGIVLDSL NAQLASTGLE YGPRPATHSR CTLGGMLGNN SCGATAQRTG KVVDNVVELE VLLYDGTRFW VGETSDEQYA EIQRRGGRRA EVYRQLRALR EEYLADIRTR YPDIPRRVSG YNLDSLLPEK GFHIAQTLVG SEGTLVTVLR ARLRLVPVVR ASALVFVNYP DIAAAGDDVM RVLAHQPVTL EGIDHRLVAD ERRKHQQLAG LREIPEGGAW LMIQMGGDTP AQARAAANRL ITAVRGGGSG TVHEFTDPAH ERQMWQVRES SLGATAQVRG ADRTWPGWED SAVAPEKLGS YLRDLRRLLN EYGLGQASLY GHFGQGCVHT RIPFQLTTAD GVARFRSFLE RAADLVVSYG GSLSGEHGDG QARGELLPKM YGSRLMRAFG QLKAIFDPAD RMNPGKTVSP YPLDSHLRLG ADYHHPSLRT TFAYPDDQGS FANAVLRCVG VGKCRRHDGG VMCPSYMVTR EEEDSTRGRS RLLFEMLDGS VRGGSIDDGW RSDAVRDALD LCLACKGCKA DCPVNVDMAT YKAEFLSHHY AGRLRPRAHY SMGWLPVLAA VAGVAPGAVN ALTQAPGLGR LAKFVGGIDQ RRDVPTFAGE SFQRWFADRT PAGDGHRGEV LLWPDTFTNR FHPGVAQAAV EVLEAAGWRV RVPDRPVCCG LTWVSTGQLG VATWMLRRTL NVLRPHLRAG TRVVGLEPSC TAVFRSDAHE LFPDDEDVTR LRQQTVTLAE LLHDHSPGWR PPRLPAHALI QTHCHQHAVL GTTADQAVLT GAGVEADFVD SGCCGLAGNF GFEQGHYEVS EACAERVLLP AVRDAAGTDV ILADGFSCRT QVEQSAAGGR SALHLAEFLR AGLHGEAVTP WPERRWGRRP QPPTRAARLA AVGLLGLAVL APVVALVASK AR
|
| |