Gene Sare_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1027 
Symbol 
ID5708258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1149989 
End bp1151365 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content58% 
IMG OID641270544 
ProductMmgE/PrpD family protein 
Protein accessionYP_001535928 
Protein GI159036675 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0490086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGAAC TGGAGAAGAA GCTGGCACGG CATACGCGAA AGTTTGTTGA CCGGCCGATG 
ACGCCGGACG AACTGCAGGT GATCAAGCGT AGTGTCGCTG ACTCTTACGC GGGAATCTGC
GCCTCATTAG TCGACACTAC CATCCTGCGT AAATTCAGTA AGGTCGTCAC GGGCCCTGGG
GCGGGATCCG GAACTCCGGT CTGGGGTGTT GGACGGGAAT CCAGTATAGA CGATGCCGTT
TTTCTCAACG CTATTCTGGC TCGCAGGAGT GATCTACTTA ACACGTATGT CTCACCCACC
GCGATGGGCA TCGTGCATCC GTCCGACAAC GTGGCCCTCG CGTTGGTTCT TGCCGATTGG
TTGAGATGGA CCGGTAAACA GTTCCTCGCT TCGGTAAATG TTTTGTTCAA TCTTTCCGCC
CGATTCGCCG ATAGTTATGA TCCTGAGGCG AGTGGCTTTG ACCATGATGC CGCCGCCACT
TTCTGGGTCG CACTCGCCGT AGGACAGGCG CTCGGCCTCT CCGAGGCTCA GCTTGTCGAG
GCACAACGCA TCGCCGGCGA GTTCGGGCTC ACCGCCAATC AGGCTGCGGT AGGCGACATC
ACTGATTGGA AGCACTGTAC CTACGCGTCC AGCGCCCTGC GGGGCCTACA GGCCGCCAGG
CTGGCCCGGG CGGGGTTCAC AGGGCCAGCC TCAATATACC AGGGTAAGTT CGGCGTGAAT
CGATTCTACA GAAGTGCCGA AATGGCGTTC GATGTTGAGC CCGACCTCAA TAGGATCATC
TTTAAGCGGT GGCCGGCGCT CTTCTACTGC CAAACTCCAA TTGACGTCGC ACGTGATCTG
TCCTCCAACA TTAGCGAGGC CTCGGATATC CGACAGGTGA AGGTGGAGAC CTACGATCGG
GCCCTACGAA ATGGCGCCAC ATCATCAGCC GACAACCCCG CCAGTCGGGC GGGCCGCACA
CACTCTATTG CGTACTGCGT CGCCACTGCG CTTCTCAAGC CTGTCGAGTA CGCCGACTTC
GACGCGGATC GCGCACGAGA CCCTCAACTC CAGCGGCTGT TGGGCGCGAT CAGCGTCATG
GAGGACTCGA CTATGACCAA GAAATTCCCA TCCTGCACAC CATGTCGGAT ATCGATTACC
TTAGAAAATG GCGAGGTCAT ACGGCAGGAA CGCGACTACT CGCACGGCGA CCCCAGGGAC
CCCCTGTCTC GCGACGAAAT TTCGGACAAG GTACGTAGAA ACCTTACGGG CCTGGCGAGC
ACTTTCAGCA AGAACAAGAT CATCTCCTGC CTATGGGGTG CGGAGAAGCT CGATGGGTTG
GCGGCCCTAC GGGCTCCGTT AGAACAGGAC CGGACAAAAG GGAGCGTATG GGAGTGA
 
Protein sequence
MGELEKKLAR HTRKFVDRPM TPDELQVIKR SVADSYAGIC ASLVDTTILR KFSKVVTGPG 
AGSGTPVWGV GRESSIDDAV FLNAILARRS DLLNTYVSPT AMGIVHPSDN VALALVLADW
LRWTGKQFLA SVNVLFNLSA RFADSYDPEA SGFDHDAAAT FWVALAVGQA LGLSEAQLVE
AQRIAGEFGL TANQAAVGDI TDWKHCTYAS SALRGLQAAR LARAGFTGPA SIYQGKFGVN
RFYRSAEMAF DVEPDLNRII FKRWPALFYC QTPIDVARDL SSNISEASDI RQVKVETYDR
ALRNGATSSA DNPASRAGRT HSIAYCVATA LLKPVEYADF DADRARDPQL QRLLGAISVM
EDSTMTKKFP SCTPCRISIT LENGEVIRQE RDYSHGDPRD PLSRDEISDK VRRNLTGLAS
TFSKNKIISC LWGAEKLDGL AALRAPLEQD RTKGSVWE