Gene Sare_2941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2941 
Symbol 
ID5707819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3332718 
End bp3334595 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content63% 
IMG OID641272390 
Productradical SAM domain-containing protein 
Protein accessionYP_001537758 
Protein GI159038505 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.151615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTCG AATCGATGCC CCTGGCGGCG GGGTATCTCA AAGCCATGGC CACGGCCGAC 
GACCGCGTAA AACGGGCTAT GTCCATCGAG ATCAGGAACT TTCGTGGCGG AGTGAACAAC
GCCTACATGG CCAACGCGAT GTTCACTCAA GGGGCACCGG ACATCATTGC CTTCTCCGTC
CTGGGCTGGA ACTACGACTC GTTCGGCGCC CTCGCCCAGA CCTTCAAACA GATCAACCCA
GATGGTTGGG TGATCTTCGG GGGCAACCAC GTCAGCAACC AGGCCGAGCG CGTCTTCCGG
ATGTACCCGG CGGTCGACGT CGTTATCAAC GGCGAGGGCG AGGTGACCTT CCGGGAACTA
CTTCGGGCCT ACCTGGATGG CGTGAACAAA CGCGATCTTG CCGCTGTCGC GGGTTTGTCC
TACCGCAATG CCGCGGCGGA GACCGTCACC ACGGCTGTAC GCCCTCGCAT CGACGACCTG
GACTCCATCC CCTCGCCCTT CCTCAACGGT GCCCTGGATC TCACCGACGC CGCCGGGAGG
TTCCGCTACG ACGTCGCCCT GATGGAAACC AACCGTGGCT GCCCCTACAA GTGCGCGTTC
TGCTATTGGG GCGGCGCTAC CGGACAGAAG GTCCGCGCGT TCTCCATCGA GCGGCTACGT
GCGGAGCTCG AACTCTTCGG CAAGCTGGGG GTCCACACGA TCGTGGCCTG CGACGCGAAC
TTCGGTATCC TGCCGGGTGA CTATGACTTC GTCAAGACGA TGATCCAGAC CCGGGAGAGG
TACGGGTTTC CTCGCGCGCT CGAAACGTCC TGGGCGAAGA ACAAGTCCGA GAACTTCTAT
CGGATCGTAC GGCTGATGAA GCAGGCCGGT ATGAAGAGCT CGTTCACGCT CGCGCTGCAG
ACGTTGCACG ACGGCGCCCT GGAAGCCATG CGGCGACGGA ACATGAAAGT CAACGAGTGG
GAGGATCTGA CGGCGTGGCT CGCGGATGAG GGGCTGGACT GCTACGCCGA GCTCATCTGG
GGTGCTCCGG GCGAGACCGT GGAGTCGTTC ATGACCGGCT ACGACAAGCT CGCCACTCGG
ATGTCGCGCA TCGCGGTGTA TCCGTTGCTG CTACTACCCA ACACCGACTA CATGGACAAG
AAAGATCAAT TCGGCATTAC CTCGGTTCGC GGCGACGACG ATGACTTCGA GTACATCCTG
GCGCACGACA CCATGACGTT CGCGGAGAAC CAGCAGATGC AGCACTTCAT CTTCTGGGCT
CGTGTCGTCG CCGAGAACGC GGTGCTCCGC CACAGCTGGG TGGCCCTGCG ACGGCTGGCC
GGGCTGAGCC AGTCGCAGGT GCTTCTCAGC CTTGGTGACT GGGTGAACGA GGTCCAGGAT
CCAGCGGCTG ACTTCCTGCG TCAGGCCGAG GCCAAGCTCG GGACTGACGG CCTCGGCGAG
GCCATCAAGT TCATGTTCAC GGATCCCGGC GCCGAGCGAC TGCTCTATCG GTGGTGGGAG
GAGTCGCTGC GCCCTCGCCT GGCCGAGTCC CTGGTACCGC TCCTGCGCGA GGTATTCCGC
TACGACCTGC TCACGCGGCC GGCCTACTAC GGCGACGACG CCGCGGATCA GGAGGTCCTA
CCCGTGGCCG CCGTCGAAGG TGATGAATAT TATGTGCGGC GTAACGTGAG CCTGACCCAC
GACATCCCGG GAATAGTCGC GGCACTGCAT GCCGGCCAGG ATCCGGACCT GTCGCCTTCT
CGCCACGTAC TCGACCTGTA CTACCGGGTG GGATCAGAGT CCGCGGTGAC GTCAACCAAC
CATGAGATCG TGGTGCACTA CATGGGACGC CCGTTGGAGG GAATCGTCAA GGAGAAGACG
CCGGAACGCG CAGGTTGA
 
Protein sequence
MPLESMPLAA GYLKAMATAD DRVKRAMSIE IRNFRGGVNN AYMANAMFTQ GAPDIIAFSV 
LGWNYDSFGA LAQTFKQINP DGWVIFGGNH VSNQAERVFR MYPAVDVVIN GEGEVTFREL
LRAYLDGVNK RDLAAVAGLS YRNAAAETVT TAVRPRIDDL DSIPSPFLNG ALDLTDAAGR
FRYDVALMET NRGCPYKCAF CYWGGATGQK VRAFSIERLR AELELFGKLG VHTIVACDAN
FGILPGDYDF VKTMIQTRER YGFPRALETS WAKNKSENFY RIVRLMKQAG MKSSFTLALQ
TLHDGALEAM RRRNMKVNEW EDLTAWLADE GLDCYAELIW GAPGETVESF MTGYDKLATR
MSRIAVYPLL LLPNTDYMDK KDQFGITSVR GDDDDFEYIL AHDTMTFAEN QQMQHFIFWA
RVVAENAVLR HSWVALRRLA GLSQSQVLLS LGDWVNEVQD PAADFLRQAE AKLGTDGLGE
AIKFMFTDPG AERLLYRWWE ESLRPRLAES LVPLLREVFR YDLLTRPAYY GDDAADQEVL
PVAAVEGDEY YVRRNVSLTH DIPGIVAALH AGQDPDLSPS RHVLDLYYRV GSESAVTSTN
HEIVVHYMGR PLEGIVKEKT PERAG