Gene Sare_4749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4749 
Symbol 
ID5705340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5373848 
End bp5374918 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID641274147 
Producthypothetical protein 
Protein accessionYP_001539493 
Protein GI159040240 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.794948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000155251 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACAGT TCGTGGACTG GGATCTGGCC GCAGCGACCG CGGGGACGCT CGGCAAGACG 
GGCCCGCGCG TGTCGTACGC CGAAGCCGCG GCGGTGGTCA GCGACCTGAG ACGGTTGACC
GACGAAGCGG CCGGGCACGT CGGCGACTTC ACCGGGCTCC GGTCGCAGGT GTCACACCCG
CCGGTGCGGG TGGTGGATCG CCGGGACTGG GCGGCGACCA ACGTCGCCGG TCTGCGCGAG
GTCATCGGTC CCCTGATCGG TCGCCTCACC GGCGACAAGC AACCCGGCGC GGTGACCGAG
GCGGTCGGCT CGCGGATCAC CGGGGTGCAG GCCGGCACGG TGCTGGCGTA CCTGTCCGGC
CGGGTCCTCG GCCAGTTCGA GGTGTTCTCC GGCGAACCAG GCCAGCTGCT GCTCGTCGCG
CCGAACATCG TCGAGGTGGA GCGGAAGCTG GCGGCGGACC CCCGCGACTT CCGGCTCTGG
GTCTGCTTGC ACGAGGTCAC CCACCGCACC CAGTTCACCG CGGTGCCGTG GCTGCGGGCG
TACTTCCTCG GTGAGGTGCA GGCGTTCGTC GACGCGTCCA ACAGCGGCGC CGACCCCTTG
GTGGAGCGGC TGCGTCGCGG CGTCGCCCTC CTTGCCGACG CGGTGCGGGA ACCGGAGAGT
CGCACCAGCG TCCTGGACAT CGTCCAGACC CCGGCCCAGA AGGCGGTGCT GAACCGGCTC
ACCGCGCTGA TGACCCTGCT CGAGGGGCAC GCCGAGTTCG TGATGGATGG CGTGGGGCCG
CAGGTGATCC CGAGTGTGGA GCGGATCCGG GCGTCGTTCA ACCGCCGTCG GGAGTCGGGT
AACCCCCTGG AGAAGACGGT CCGTCGGCTG CTCGGGGTGG AGGTCAAGCT GCGCCAGTAC
GCCGAGGGGC GGACGTTCGT GCACGGTGTG GTCGACCGGG TCGGCATGGA GGGCTTCAAC
CGGGTCTTTG CCTCCCCGCT GACCCTGCCC CGGCTCGAGG AACTCGGCGA TCCGGACGCC
TGGGTGGCCC GGGTGCACGG GCCGGCCGGT CCGCTTCCGG CCGTCGGCTG A
 
Protein sequence
MAQFVDWDLA AATAGTLGKT GPRVSYAEAA AVVSDLRRLT DEAAGHVGDF TGLRSQVSHP 
PVRVVDRRDW AATNVAGLRE VIGPLIGRLT GDKQPGAVTE AVGSRITGVQ AGTVLAYLSG
RVLGQFEVFS GEPGQLLLVA PNIVEVERKL AADPRDFRLW VCLHEVTHRT QFTAVPWLRA
YFLGEVQAFV DASNSGADPL VERLRRGVAL LADAVREPES RTSVLDIVQT PAQKAVLNRL
TALMTLLEGH AEFVMDGVGP QVIPSVERIR ASFNRRRESG NPLEKTVRRL LGVEVKLRQY
AEGRTFVHGV VDRVGMEGFN RVFASPLTLP RLEELGDPDA WVARVHGPAG PLPAVG