Gene Sare_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3863 
Symbol 
ID5705894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4394559 
End bp4396544 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content68% 
IMG OID641273284 
Productradical SAM domain-containing protein 
Protein accessionYP_001538646 
Protein GI159039393 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.721561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0046189 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGCCC CGTCCACCAC GCCGCGCAGC GCTGCGGCCA ACTCCGTGTG GCCCCGGTTG 
GAGCCGTTGC TGCCCCAGAT CTCCAAGCCC ATCCAGTACG TCGGTGGTGA GTTGGGGGCG
GTGGTCAAGG ACTGGGACAC GGCGGCCGTG CGCTGGGCCC TGATGTACCC CGACGCATAC
GAGGTCGGCC TGCCGAACCA GGGTGTGCAG ATCCTGTACG AGGTGCTCAA CGAGCTGCCC
GACGTGCTCG CTGAGCGGAC GTACGCGGTC TGGCCGGACC TGGAGCGGCT GATGCGTGCC
CACCAGGTGC CGCAGTTCAC CATCGACGCG CACCGTCCGG TGCGTGAATT CGACGTGTTC
GGCGTCTCCT TCGCCACCGA GCTGGGCTAC ACGAACCTGC TCACCGCGAT CGACCTGGCC
GGCATGCCCC TGCTGGCCGC CGACCGTACC GACGCCGATC CGGTGGTCGT CGCCGGCGGG
CACGCCGCGT TCAACCCGGA GCCGATTGCC GACTTCGTCG ACGCCGCCGT GCTCGGGGAC
GGCGAGGAGG CCGTCCTGGA GATCACCAGG ATTGTCCGGG AGTGGAAGGC CGAGGGCTTT
CCCGGCGGTA GGGACGAGCT GCTGCTGCGG CTGGCTCGCA CCGAGAGCGT CTACGTGCCG
CGCTTCTACG ACGTGGACTA TCTGCCCGAC GGCCGGATTC AGCGGATCGT GCCGAACCGC
CCGGATGTGC CGTTCCGGGT GCACAAGCGC ACGACGATGG ATCTGGACGC GTGGCCGTAC
CCGAAGAGGC CGCTGGTGCC GCTGGCGGAG ACCGTCCACG AGCGGTACGC GGTGGAAATC
TTCCGGGGGT GCACCCGGGG CTGCCGGTTC TGCCAGGCGG GGATGATCAC CCGACCGGTG
CGGGAACGTT CGATCACCAC AGTCGGACAG ATGGTGCGGG AAGGGCTGGA GTTCTCCGGC
TTCCACGAGG TGGGCCTGCT GTCGCTGTCG TCGGCCGACC ACTCGGAGAT CGGCGACATG
TGCTCTGGGC TGGCCCAGCA GTACGAGGGC ACCAACGTCT CGCTGTCGCT GCCGTCGACC
CGAGTGGACG CGTTCAACAT CGAGCTGGCG CAGGAGCTGT CCCGCAACGG CCGACGGACC
GGTCTGACCT TCGCCCCGGA GGGCGGGTCG GAGCGGATCC GAAAAGTGAT CAACAAGATG
GTGTCGAAGG ACGACCTCAT CCGAACCGTG GTCACCGCGT ACACCAATGG CTGGCGGCAG
GTGAAGCTCT ACTTCATGTG CGGGTTGCCC ACCGAGACCG ACGAGGACGT CCTCGAGATC
GCGGCGATGG CGCACGAGGT CATCCGGGCC GGCCGTGCGG CGACCGGTAG CAGGGACATC
CGCTGCACGG TCTCCATCGG TGGATTCGTG CCGAAGCCGC ACACCCCGTT CCAATGGGCG
GCCATGGCGC GGCCGGAGGT CATCGACAAC CGCTTGCGGC TGCTCAAGCA GGCAGTCAAT
GCGGACCGTT CGCTGGGTCG GGCGATCGGA TTCCGATACC ACGATGGCGA GCCGTCGCTG
ATCGAGGGGC TGCTCTCCCG CGGTGACCGT CGGGTCAGTG CGGTGATCCG ACGGGTCTGG
GAGAACGGGG GCCGGTTCGA CGGTTGGAGC GAGCACTTCT CGTACCAGCG TTGGGTGGAC
GCCGCTGCCG AGGTTCTGCC CGGCTTCGGG ATCGACCTTG ACTGGTACAC CACCCGGGAA
CGTGACGAGC TGGAGGTCCT ACCCTGGGAC CACCTGGATT CGGGTCTCGA CAAGGACTGG
CTCTGGCAGG ACTGGCAGGA CGCCCTGGGC GAGTATGAGC AGGACGACTG CCGCTGGACG
CCGTGCTTCG ACTGCGGCGT CTGTCCGTCC ATGGACACCG AGATTCAGAT CGGCCCAACG
GGTAGGAAGC TGCTCCCGCT GACGCCGGTC AACGGGCTTC GGGTGCCCAG CGGAGCCCCG
CAGTAG
 
Protein sequence
MSAPSTTPRS AAANSVWPRL EPLLPQISKP IQYVGGELGA VVKDWDTAAV RWALMYPDAY 
EVGLPNQGVQ ILYEVLNELP DVLAERTYAV WPDLERLMRA HQVPQFTIDA HRPVREFDVF
GVSFATELGY TNLLTAIDLA GMPLLAADRT DADPVVVAGG HAAFNPEPIA DFVDAAVLGD
GEEAVLEITR IVREWKAEGF PGGRDELLLR LARTESVYVP RFYDVDYLPD GRIQRIVPNR
PDVPFRVHKR TTMDLDAWPY PKRPLVPLAE TVHERYAVEI FRGCTRGCRF CQAGMITRPV
RERSITTVGQ MVREGLEFSG FHEVGLLSLS SADHSEIGDM CSGLAQQYEG TNVSLSLPST
RVDAFNIELA QELSRNGRRT GLTFAPEGGS ERIRKVINKM VSKDDLIRTV VTAYTNGWRQ
VKLYFMCGLP TETDEDVLEI AAMAHEVIRA GRAATGSRDI RCTVSIGGFV PKPHTPFQWA
AMARPEVIDN RLRLLKQAVN ADRSLGRAIG FRYHDGEPSL IEGLLSRGDR RVSAVIRRVW
ENGGRFDGWS EHFSYQRWVD AAAEVLPGFG IDLDWYTTRE RDELEVLPWD HLDSGLDKDW
LWQDWQDALG EYEQDDCRWT PCFDCGVCPS MDTEIQIGPT GRKLLPLTPV NGLRVPSGAP
Q