Gene Sare_4482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4482 
Symbol 
ID5706922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5067785 
End bp5069827 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content70% 
IMG OID641273896 
Producthypothetical protein 
Protein accessionYP_001539245 
Protein GI159039992 
COG category[R] General function prediction only 
COG ID[COG3973] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGTCG ATCTCGACGC CGAACTCGAA GCCGAACGAA CACACATGGC CACGTCGCGG 
GCGGCCCTGC ACGCGATGCG CCTGCACGCC GAAGCCATGT ACGCCATCGG CGCGAAGATC
GCCGGTGACC CCTTCTCCGC CGAGACCCTT GGTGTCGCGC TGGCCCGGCG GGTGGCCGAA
CTCGCCGACG ACCCCCGTAC GCCGCTGTTC TTCGGCCGCC TCGACCTCGA TGGCGCTCTG
GTGCCCGCGG ACGAGCCATT GACCGGACGG TTTCACATCG GACGTCGGCA CGTCACCGAC
CAGCTCGGCG ATCCGCTGGT GCTGGACTGG CGGGCCCCGG TCAGCCGGGC CTTCTACCAG
GCCAGCGCCC GCGACCCGCA GGGCGTGCAG ACGCGGCGGC GGTTCGGGTT CGCCGGCGGC
GACCTGACGA GCTTCGAGGA CGAGCGCCTC GACCGGGGCG AGGAACTCGG CACCGCGAGC
AAAATCCTTA TTTCGGAGAT CGAGCGCCCG CGCGTCGGGC CGATGCGCGA CATCGTCGCC
ACGATCCAAC CCGAGCAGGA CGAGCTGGTT CGCGCTGACC TCGACACCTC GATCTGCGTG
CAGGGGGCGC CAGGAACAGG AAAGACTGCG GTGGGCTTGC ACCGGGCCGC GTACCTGCTG
TACCTGCACC GCGAGCGGCT TCGGCGCACC GGTGTGCTGA TCCTCGGACC GAACCAGGCG
TTCCTCAGCT ACATCGCCGC CGTCCTGCCC GCTCTCGGCG AGTTGGAGGT CAAGCAGTCC
ACGCTGGAAG GGCTTATCGG CCGGATGCCC GTCTCGGCCG TCGATCCGGT CGAGACCGAG
ATCGTCAAGC ACGATCCTCG GATGGCGACC GTACTGCACC GGGCGCTGTG GTCGCGGTTG
AGCCGAGCCG AATCTCCGAT CCAGGTCTCC GACGGCGCCT ACCGGTGGCG CATCGACCCG
GGAGTGCTGA CCAGGCTGGT CGACGACACG CAGCGGGAGA ACCTCGCCTA CCTGCTGGGC
CGCGAACGCG TACGCGCCCG GGTCGTCGGG TTGCTGCAAC GGCAGGCCGA GACGCGGTCG
GGCGAGTCAC CCGGTGAGCC GTGGCAGCGG CGGATGGGCA AGACCAAGCC GGTCGTCGAG
TTCCTCGACG CCTGCTGGCC GGCGATGACG CCGGAGTCGC TGGTCTTCGA GCTGCTCAGC
GACCCTGCGG CACTGGCCCG TGCCGCCGAG GGCATCCTCA CCGAGGACGA GCAGCAGGCG
CTCCTGTGGC GCAAGCCCTA CCGGACCGTC AAGAGCGCCA AGTGGAGCAG TGCCGACCTC
GTGCTGCTCG ACGAGGCCGC CGGGCTGCTC GAACGCGAGA CCAGCCTCAG CCACATCATC
ATCGACGAAG CGCAGGACCT CTCACCGATG CAGTGCCGGG CCATCGCGCG GCGCACCACA
CATGGATCGA TCACGCTGCT CGGCGACCTC GCCCAGGGCA CGGCACCGTG GGCGGCGACG
GACTGGACCG ACTTGCTGCG GCACCTGGGT AAGCCCGATG CGCCGGTCGT GCCGCTGACC
ACCGGGTTCC GCGTACCCGA GGCGGTCGTG GCCCTGGCCA ACCGGCTGCT ACCGGCCCTG
GCAGTGAACG TGCCCCCGGC AGTGTCGCTG CGCCAGGACG GCAACCTCAT CATCCGCAGC
GTCGACGACC TGGCCGCAGC CGTGGTCGCG GAACTGGACG CAACGCGCTC GCTACCCGGG
TCGGTCGGAG TCATCGCCGC CCACCGGCAC CTCGACGGAC TGCGGGCCGC GCTCGCCTCG
GCCGGAGAGC AGCCCGGCGA ACTCGACGAC CCGGAGAGCA ACCGCGTCAC GGTGGTCCCG
GCGACCCTTG CCAAGGGCCT GGAGTACGAC CACGTCATCG TCGTCGAACC GGCGGACATC
GTGGACGCGG AACCGCGCGG GCTGCACCGG CTCTACGTTG TGCTCACCCG CGCCGTGACA
CGGCTAACGG TGCTGCACGC CCGCGCGCTG CCCGAAGCAC TGATTGTCGC CGGTCAGGGC
TGA
 
Protein sequence
MLVDLDAELE AERTHMATSR AALHAMRLHA EAMYAIGAKI AGDPFSAETL GVALARRVAE 
LADDPRTPLF FGRLDLDGAL VPADEPLTGR FHIGRRHVTD QLGDPLVLDW RAPVSRAFYQ
ASARDPQGVQ TRRRFGFAGG DLTSFEDERL DRGEELGTAS KILISEIERP RVGPMRDIVA
TIQPEQDELV RADLDTSICV QGAPGTGKTA VGLHRAAYLL YLHRERLRRT GVLILGPNQA
FLSYIAAVLP ALGELEVKQS TLEGLIGRMP VSAVDPVETE IVKHDPRMAT VLHRALWSRL
SRAESPIQVS DGAYRWRIDP GVLTRLVDDT QRENLAYLLG RERVRARVVG LLQRQAETRS
GESPGEPWQR RMGKTKPVVE FLDACWPAMT PESLVFELLS DPAALARAAE GILTEDEQQA
LLWRKPYRTV KSAKWSSADL VLLDEAAGLL ERETSLSHII IDEAQDLSPM QCRAIARRTT
HGSITLLGDL AQGTAPWAAT DWTDLLRHLG KPDAPVVPLT TGFRVPEAVV ALANRLLPAL
AVNVPPAVSL RQDGNLIIRS VDDLAAAVVA ELDATRSLPG SVGVIAAHRH LDGLRAALAS
AGEQPGELDD PESNRVTVVP ATLAKGLEYD HVIVVEPADI VDAEPRGLHR LYVVLTRAVT
RLTVLHARAL PEALIVAGQG