Gene Sare_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1074 
Symbol 
ID5704342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1203046 
End bp1206114 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content71% 
IMG OID641270589 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001535973 
Protein GI159036720 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase
[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.899734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000498092 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTGCCCC AGCCCGTCGT GCGCGCACCC GGGCCGGCAC CCGAGGTCGA CCTGGCCGCG 
CTCGTCGCCG ACCTGCGGGC CGAGGTGGAT GGAGAGGTCC GGTTCGACGT CGGTTCGCGG
GCCGCCTACT CCACCGACGC CTCCAACTAC CGGCAGGTGC CACTCGGAGT GGTGGTGCCC
CGTACGGTCG AGGCGGGCGT GGCGGCGGTC GCGGTGTGCC GCCGGCACGG CGCTCCGCTG
GTTTCCCGGG GCGGCGGCAC CAGCCTGGCT GGACAATGCA CCAACACCGC TGTCGTGCTG
GACTGGTCGA AGTACTGCCA CCTCCTGCTG GAGGTCGATC CGCAGGCGCG GACCTGCCTG
GTGGAACCCG GCATCGTGTT GGACTCACTC AACGCCCAAC TCGCCTCGAC CGGGCTGGAG
TACGGTCCCC GCCCGGCCAC CCACAGTCGC TGCACCCTGG GTGGCATGCT CGGCAACAAC
TCCTGCGGAG CCACCGCACA GCGCACCGGG AAGGTTGTCG ACAACGTCGT CGAACTGGAG
GTCCTGCTCT ACGACGGCAC CAGGTTCTGG GTGGGCGAGA CCAGCGACGA GCAGTACGCC
GAGATCCAGC GCCGCGGCGG GCGGCGGGCG GAGGTCTACC GTCAGCTGCG GGCACTACGC
GAGGAGTACC TGGCCGACAT CCGTACCCGC TACCCGGACA TTCCTCGCCG GGTGTCCGGG
TACAACCTCG ACAGTCTGCT GCCGGAGAAG GGCTTCCACA TCGCGCAGAC CCTGGTCGGC
TCCGAGGGCA CGCTGGTCAC CGTTCTCCGG GCACGGCTGA GGCTGGTGCC GGTGGTGCGG
GCGTCCGCCC TGGTCTTCGT CAACTACCCC GACATCGCGG CTGCGGGCGA CGACGTCATG
CGGGTGCTGG CACACCAACC GGTCACCCTG GAGGGGATCG ACCACCGGCT CGTCGCCGAC
GAACGGCGTA AGCACCAGCA ACTGGCAGGG CTCCGGGAGA TCCCCGAGGG CGGCGCCTGG
CTGATGATCC AGATGGGTGG CGACACGCCA GCACAGGCCC GCGCCGCCGC CAACCGGTTG
ATCACCGCCG TACGCGGCGG CGGTTCCGGG ACCGTGCACG AGTTCACCGA TCCTGCCCAT
GAACGGCAGA TGTGGCAGGT CCGGGAGTCA TCCCTCGGGG CCACCGCGCA GGTGCGAGGT
GCCGACCGTA CGTGGCCGGG CTGGGAGGAT TCGGCCGTTG CCCCGGAGAA GCTTGGCAGC
TACCTGCGGG ACCTGCGACG GCTCTTGAAC GAGTACGGCC TTGGGCAGGC GTCGTTGTAC
GGCCACTTCG GTCAGGGATG CGTGCACACG CGTATTCCGT TCCAGCTGAC CACCGCCGAC
GGGGTGGCGC GGTTCCGGTC CTTCCTCGAG CGTGCCGCCG ACCTGGTCGT CTCCTACGGT
GGATCCCTCT CCGGTGAGCA CGGGGACGGC CAGGCCCGGG GTGAACTGCT GCCGAAGATG
TACGGCAGCC GGCTGATGCG CGCGTTCGGC CAGCTCAAGG CGATCTTCGA TCCGGCTGAC
CGGATGAATC CGGGTAAGAC GGTGTCGCCC TACCCGCTCG ACAGCCACCT GCGGTTGGGG
GCCGACTACC ACCATCCTTC GCTGCGAACC ACGTTCGCCT ACCCCGACGA CCAGGGCAGT
TTCGCCAACG CCGTACTGCG CTGCGTCGGG GTGGGCAAGT GCCGCCGCCA CGACGGTGGG
GTGATGTGCC CGTCCTACAT GGTCACTCGT GAGGAGGAGG ACTCCACGCG GGGCCGTTCC
CGGCTGCTGT TCGAGATGCT CGACGGCAGC GTCCGGGGCG GCAGCATCGA CGACGGCTGG
CGCTCCGACG CGGTGCGCGA CGCCCTCGAC CTCTGCCTGG CCTGCAAGGG GTGCAAGGCG
GACTGTCCGG TGAACGTGGA CATGGCGACC TACAAGGCGG AGTTCCTGTC CCACCATTAC
GCGGGCCGGT TACGTCCCCG CGCCCACTAC TCGATGGGGT GGCTGCCGGT GCTGGCGGCG
GTGGCCGGGG TCGCGCCGGG CGCGGTGAAC GCCCTCACAC AGGCGCCCGG CCTGGGCCGG
CTCGCCAAGT TCGTCGGCGG TATCGACCAG CGCCGGGACG TACCGACCTT CGCCGGGGAG
AGCTTCCAGC GGTGGTTCGC CGACCGGACC CCGGCTGGGG ACGGCCACCG CGGCGAGGTG
CTGCTCTGGC CGGACACCTT CACCAACCGA TTCCATCCCG GTGTGGCCCA GGCAGCGGTC
GAGGTGCTGG AGGCCGCCGG ATGGCGGGTT CGGGTGCCGG ACCGGCCGGT CTGCTGCGGG
CTGACCTGGG TCTCCACCGG CCAACTCGGC GTCGCCACGT GGATGCTGCG GCGGACCCTG
AACGTCCTTC GGCCGCACCT GCGGGCCGGT ACCCGGGTGG TCGGTCTGGA ACCGAGCTGT
ACGGCCGTGT TCCGCAGTGA CGCCCACGAG CTGTTCCCGG ATGACGAGGA CGTCACCCGC
CTCCGCCAGC AGACGGTCAC CCTGGCCGAG CTGCTCCATG ACCACAGCCC TGGCTGGCGG
CCACCGCGGC TACCGGCGCA CGCGCTGATC CAGACCCACT GCCACCAGCA CGCCGTCCTG
GGTACCACCG CCGACCAGGC AGTGCTCACC GGCGCTGGGG TGGAAGCCGA CTTCGTCGAC
TCGGGCTGCT GCGGGTTGGC CGGCAACTTC GGCTTCGAGC AGGGGCACTA CGAGGTCTCC
GAGGCATGTG CCGAGCGGGT GCTGCTGCCA GCCGTTCGGG ACGCCGCCGG CACCGACGTG
ATTCTCGCCG ACGGGTTCAG CTGTCGAACC CAGGTGGAGC AGAGCGCGGC TGGCGGACGA
TCGGCGCTGC ACCTGGCCGA GTTCCTGCGA GCCGGGTTGC ACGGCGAGGC GGTGACGCCC
TGGCCGGAAC GTCGGTGGGG GCGCCGTCCG CAGCCGCCTA CCCGGGCGGC CCGGCTGGCC
GCGGTCGGGC TGCTCGGCTT GGCCGTCCTC GCGCCGGTGG TCGCCCTCGT CGCGTCGAAG
GCTCGGTGA
 
Protein sequence
MLPQPVVRAP GPAPEVDLAA LVADLRAEVD GEVRFDVGSR AAYSTDASNY RQVPLGVVVP 
RTVEAGVAAV AVCRRHGAPL VSRGGGTSLA GQCTNTAVVL DWSKYCHLLL EVDPQARTCL
VEPGIVLDSL NAQLASTGLE YGPRPATHSR CTLGGMLGNN SCGATAQRTG KVVDNVVELE
VLLYDGTRFW VGETSDEQYA EIQRRGGRRA EVYRQLRALR EEYLADIRTR YPDIPRRVSG
YNLDSLLPEK GFHIAQTLVG SEGTLVTVLR ARLRLVPVVR ASALVFVNYP DIAAAGDDVM
RVLAHQPVTL EGIDHRLVAD ERRKHQQLAG LREIPEGGAW LMIQMGGDTP AQARAAANRL
ITAVRGGGSG TVHEFTDPAH ERQMWQVRES SLGATAQVRG ADRTWPGWED SAVAPEKLGS
YLRDLRRLLN EYGLGQASLY GHFGQGCVHT RIPFQLTTAD GVARFRSFLE RAADLVVSYG
GSLSGEHGDG QARGELLPKM YGSRLMRAFG QLKAIFDPAD RMNPGKTVSP YPLDSHLRLG
ADYHHPSLRT TFAYPDDQGS FANAVLRCVG VGKCRRHDGG VMCPSYMVTR EEEDSTRGRS
RLLFEMLDGS VRGGSIDDGW RSDAVRDALD LCLACKGCKA DCPVNVDMAT YKAEFLSHHY
AGRLRPRAHY SMGWLPVLAA VAGVAPGAVN ALTQAPGLGR LAKFVGGIDQ RRDVPTFAGE
SFQRWFADRT PAGDGHRGEV LLWPDTFTNR FHPGVAQAAV EVLEAAGWRV RVPDRPVCCG
LTWVSTGQLG VATWMLRRTL NVLRPHLRAG TRVVGLEPSC TAVFRSDAHE LFPDDEDVTR
LRQQTVTLAE LLHDHSPGWR PPRLPAHALI QTHCHQHAVL GTTADQAVLT GAGVEADFVD
SGCCGLAGNF GFEQGHYEVS EACAERVLLP AVRDAAGTDV ILADGFSCRT QVEQSAAGGR
SALHLAEFLR AGLHGEAVTP WPERRWGRRP QPPTRAARLA AVGLLGLAVL APVVALVASK
AR