Gene Sare_4112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4112 
Symbol 
ID5707663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4672401 
End bp4673684 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content71% 
IMG OID641273540 
Producthypothetical protein 
Protein accessionYP_001538893 
Protein GI159039640 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.7351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0620226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGATA TTCCGTTCGG TTTCGCGCTC CCGGGTGGGC AACCACCAGA CCCCAACGAT 
CCCGCGCAGA TGCAGCAATT CATGACCCAG TTGCAGCACC TGCTCTCCGC ACCCGGTAGC
GGACCGGTGA ACTGGGACCT GGCGCGCCAG GTGGCCGCGA GCCAACTCAG CGCCGGGGGC
GACCCGGCTG TCTCGCCGTA CGAACGCAAT GCGGTGGAGG AGGCGCTGCG CCTGGCCGAT
CACTGGCTGG AGCCGGCCTC GGCACTCCCA TCGGGAATCC ACACCTCGAT GGCATGGAAC
CGCAATGAGT GGATTTACAA AACCCTCGAT GTCTGGCGCA AGCTGTGCGA CCCGGTGGCC
AGCAGGATGG TCGGTGCGAT GGGTGACCTG GTGCCGCCGG AGGCCCGGGC CCAGCTCGGG
CCGATGCAGT CGATGGTGGC CACCCTCGGC GGTGCGCTCT TCGGGGGCCA ACTGGGCCAA
GCCCTCGGCT CCCTCGCCGC CGAGGTGCTC TCGGCTGGCG ACATCGGGTT GCCACTCGGC
CCAGCCGGCA CGGCCGCGCT CATCCCGGCC AACATCCGGG CCTACGGTGC CGGGCTGGAA
CTGCCCGAGG ACGAGGTACG CCTCTACGTG GCGCTACGCG AGGCCGCTCA CCAGCGACTC
TTCGAACACG TGCCGTGGCT GCGCGGACAC GTGCTCAACG CGGTGGAGAT GTACGCCTCG
GGTATCCGGG TCAACCGCGA GGCGATCGAG GAAGCGATGG GCCGAGTCGA CCCGACCGAC
CCAGAGTCGA TGCAGGCGAT CGCGCTCGAG GGCATCTTCA CCCCGGAGGA CAGCCCGGCC
CAGAAGGCGT CACTGGCCCG GCTGGAGACG GCGCTCGCCC TCGTCGAGGG TTGGGTCTGC
CACGTGGTGG ACAGCGCGGC CGGAGGGCGG CTGCCCAACG TCGTCCGACT CGGTGAGGCG
TTCCGCCGGC GGCGGGCCGC AGGCGGTCCG GCCGAACAGA CCTTCGCCGC CCTGGTCGGC
CTGGAGTTGC GCCCACGCCG GCTACGGGAG GCGGCGGCGC TCTGGGCGGC CCTCGCCGAG
CACCGGGGGA TTGCCGGCCG GGATGCGTTG TGGGGTCACC CCGACCTACT ACCGTCCGAC
GACGACTTCG CCGACCCGGT GGCCTTCGCC CAGTCCCGGC TCGACGCCGG CGAGCTGGAG
GGCTTTGACT TCAGCGCACC TGGTGGCCCG CCGGAGCAGG CTCCGGGCGA GGCCGACGGG
GAGGAACCGC CCGCCACCCG CTGA
 
Protein sequence
MPDIPFGFAL PGGQPPDPND PAQMQQFMTQ LQHLLSAPGS GPVNWDLARQ VAASQLSAGG 
DPAVSPYERN AVEEALRLAD HWLEPASALP SGIHTSMAWN RNEWIYKTLD VWRKLCDPVA
SRMVGAMGDL VPPEARAQLG PMQSMVATLG GALFGGQLGQ ALGSLAAEVL SAGDIGLPLG
PAGTAALIPA NIRAYGAGLE LPEDEVRLYV ALREAAHQRL FEHVPWLRGH VLNAVEMYAS
GIRVNREAIE EAMGRVDPTD PESMQAIALE GIFTPEDSPA QKASLARLET ALALVEGWVC
HVVDSAAGGR LPNVVRLGEA FRRRRAAGGP AEQTFAALVG LELRPRRLRE AAALWAALAE
HRGIAGRDAL WGHPDLLPSD DDFADPVAFA QSRLDAGELE GFDFSAPGGP PEQAPGEADG
EEPPATR