Gene Sare_0010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0010 
Symbol 
ID5707391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp14087 
End bp15292 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID641269534 
ProductXRE family transcriptional regulator 
Protein accessionYP_001534937 
Protein GI159035684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.141867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00323142 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCCGA GGCACGACCC GATACCGCCC GGTGGGCGGA TCAAGCTCTA TCGCAGGCGG 
CGCGGGCTCA CCCAAGAGGT GTGCGCCCAA CTCAAGGGTG TCAGCGTGGG CGCCTGGCGG
AAGTGGGAAT CCGGGGAGCG GTCGGTCAAC AGTCTGGCTG ACTGGATCGA GATTGCTCGG
ATTCTCAATG TTCGTGACCT CTACAAACTC ACCGGCCATC CGCTCGGGGT ACTGCCCGAT
GACCCGGCCG AACATGAGTC GGTGCCGCCG CTACGGGCGG CGATGACCGC CTACGCGCCT
GATATTGACC AGCTATCAAG CGTCGCCGAG CTACGGTCGG CAGTCAGACT GGCGTGGACG
ACGTGGTACC AGTCGCGGCA GCGCTACACC TACACCAGCC CGGTCCTACC GGGCCTGGTC
CACGCATCCC GGGCCGCGGT TGCCAGCCTC GACGGCAACG AGCGACGGCA AGCCCAGCGG
GTCGCGGCCG ACCTGTACCT ACTCGTCCGG GCGTTCGCGA AGAAGGTCGG CGCGCAGGAT
CTCGCGGTCA TCGCCGCCGA CCGGGCGCTG ACAGCCGCAT ACGAGGCCGA TGATCCCGCC
TACCGGGCGT CCGCGGCGTG GAACATGGGC CAGGTGCTCT CCAATCGAGG GCATACCGAG
GATGCGGTGG ACATGTGCCG GCAGGCCATC GCCGACCTGC AACGCACCGC CGACGACGAC
CCTGTTCGCC TCGCGGCACT CGGCGGGCTG CACCTGCTGC TGTCCATCCA GTACGCCCGA
CTGCGAGACG AACGACGAAC CCTTGACGTG CTTGATCGTG CCGACGAACT TGCGGCCCGC
ACCGGGGAGA CAGAGCACCA TTTCATCTTC TTCGGTCCGA CCAATACGGC GATTCACCGC
GCCGCCGCAA CACTGGAACT ATCCCGACCG GGGGAGGCGG CGCGAATCGC GGAGCGGGTC
GACGTGGGTC GGTCGCCGTC GATTGAGCGG CGGCACTCCC ATCTCACTCA CCTGGCGCGG
GCGTACGCCA GCAAGCGCGA TGACTACGCC GCGATCCACA TGTTGCAACG CGCGCACCGG
GAGTCCCCCG AGGAATCATC GTTGAACCTA CTCATGCGGG GGACGGTACG GGAGCTGCTG
ACCAGGGAGA CTGCCACCAG TCGGGACGAA TTGCGCGGTC TCGCGGAGCT GGTAGGCGTG
GTCTGA
 
Protein sequence
MEPRHDPIPP GGRIKLYRRR RGLTQEVCAQ LKGVSVGAWR KWESGERSVN SLADWIEIAR 
ILNVRDLYKL TGHPLGVLPD DPAEHESVPP LRAAMTAYAP DIDQLSSVAE LRSAVRLAWT
TWYQSRQRYT YTSPVLPGLV HASRAAVASL DGNERRQAQR VAADLYLLVR AFAKKVGAQD
LAVIAADRAL TAAYEADDPA YRASAAWNMG QVLSNRGHTE DAVDMCRQAI ADLQRTADDD
PVRLAALGGL HLLLSIQYAR LRDERRTLDV LDRADELAAR TGETEHHFIF FGPTNTAIHR
AAATLELSRP GEAARIAERV DVGRSPSIER RHSHLTHLAR AYASKRDDYA AIHMLQRAHR
ESPEESSLNL LMRGTVRELL TRETATSRDE LRGLAELVGV V