Gene Sare_4151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4151 
Symbol 
ID5708308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4718651 
End bp4719751 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID641273579 
Productfumarate reductase iron-sulfur subunit 
Protein accessionYP_001538932 
Protein GI159039679 
COG category[C] Energy production and conversion 
COG ID[COG0479] Succinate dehydrogenase/fumarate reductase, Fe-S protein subunit 
TIGRFAM ID[TIGR00384] succinate dehydrogenase and fumarate reductase iron-sulfur protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0159941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00206659 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAACTG AGAAGAGCCA GGCGGCTGGT GAGCCGGCCA CGAAGCGGCA GTTCCGCATC 
TGGCGGGGCG ACGAGACCGG CGGCGACCTG CAGGACTACC TGGTCGAGGT GAACGAGGGC
GAGGTGGTAC TGGACGTCAT CCACCGGCTC CAGAGCACCG ACGCGCCGGA CCTCGCCTGC
CGTTGGAACT GCAAGGCCGG CAAGTGCGGG TCCTGCTCGG TCGAGATCAA CGGTAAACCG
AAGCTGGCGT GCATGACCAG AATGTCAACG TTCACCGAGG ACGAGACCAT CTCGGTCACG
CCGCTGCGGA CGTTCCCGAT CGTTCGGGAT CTGGTCACCG ACGTCTCGTT CAACTACGAG
AAGGCACGTG AGACGCCGGC CTTCGCACCG CCGCCCGGTG TCACCCCGGG TGACTACCGG
ATGCAGCAGG TCGATGTCGA GCGCTCGCAG GAGTTCCGTA AGTGCATCGA GTGCTTCCTG
TGCCAGACGG TCTGTCACGT GATCCGGGAC CACGAGGAGA ACAAGCCGGC TTTTGCCGGA
CCGCGGTACT TCATCCGGGC GGCCGAGTTG GACATGCACC CGCTGGACAC GCGGGACGAC
CGCAAGGAGT ACGCACAGGC CGAGCAGGGC TTGGGCTACT GCAATATCAC CAAGTGCTGC
ACCGAGGTCT GCCCCGAACA CATCAAGATC ACCGATAACG GGATTATCCC CATGAAGGAG
CGGGTAGTCG ACCGCAAGTA TGATCCCCTA GTGTGGCTTG GTAGCAAGAT CTTCCGGAGG
GGTCAGGTGC CTCAGACCAG CGTGACCAGC GAGCATTCCC CGGGCGCCGT GCACACCCGC
GCGGCCGGTC CGCCGGCGGT CCACTCGCAC GCGGGAGGGT CGCACGACCC ACAGGCCGAG
GCCCAGGCGC AGGCGGGCGT CAACTGGCAC CGCGAGGTGC CGAAGCCGAC CGCACCGGCG
GTCGACGCGT CCGGCAAGCT TCCGCTGACC GAGCTCACCT TCGATCGGGC GGCGGCGCCG
TCACCGTTCG GCGACGACGT GAGCTTCCCA CTGCCGCCCG AACATCTGAA CTTCGCCCAC
CCGGAGCAGG ACAAGCACTG A
 
Protein sequence
MGTEKSQAAG EPATKRQFRI WRGDETGGDL QDYLVEVNEG EVVLDVIHRL QSTDAPDLAC 
RWNCKAGKCG SCSVEINGKP KLACMTRMST FTEDETISVT PLRTFPIVRD LVTDVSFNYE
KARETPAFAP PPGVTPGDYR MQQVDVERSQ EFRKCIECFL CQTVCHVIRD HEENKPAFAG
PRYFIRAAEL DMHPLDTRDD RKEYAQAEQG LGYCNITKCC TEVCPEHIKI TDNGIIPMKE
RVVDRKYDPL VWLGSKIFRR GQVPQTSVTS EHSPGAVHTR AAGPPAVHSH AGGSHDPQAE
AQAQAGVNWH REVPKPTAPA VDASGKLPLT ELTFDRAAAP SPFGDDVSFP LPPEHLNFAH
PEQDKH