Gene Sare_4993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4993 
Symbol 
ID5705733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5662171 
End bp5663448 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content67% 
IMG OID641274386 
Producthypothetical protein 
Protein accessionYP_001539727 
Protein GI159040474 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000545579 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAACAC GCACAGAAAT GGGCGCCGCC GGCGCCGCCG GAACCGCCGC GGTGGCCGCC 
GCCACCGCGT TCGCGATGAT GGGCAACGCG GTCAGCGCCG CGCCTGTCGG CCCGATCAAC
CCCGTCCTGG CCACCGAAGG GTTCCTGGTC ATGACGGAAG GTGACGCGAA CCTGATCGGC
GCGGAGAACG AGGGGACGCT GGCGGTCGGC GGAAACCTCA CCTTCGCGCA ATATCAACTG
GCAACAGTGA GCGCGGGGAA CTTCGTCGAT CTTGGCGACA ACAACCCCAC CGCGCTCGTG
GTCAACGAGC GGGTGGACTT CGTCAACAGC ACGCCCGGCA CCCGGCTCCA GGTTCTCCAG
AACGGCTACG TCAAGATCAG AAATCTGACC GACACCTTCG TGCGGAACGT CGACAACAAC
AACGCGGCGG TGAACACGCG GGTCCTGCCG GTCGACGACT ACGACGCCTT TCCACGAGTC
GAGCTCGTCA CCCGTCAGCC GGTGTCGAGT GTCGGGCCGG CGTTCGTGCT CGACATCGCC
GCGGCGTTCC AGTCCTTCCG GTCGGCCTCC ACCGAACTCG CGACATGCCA GAACACCGTG
GCGCTGCGAG ACCCGAACGG CGACCCGATT CCCACCCCCA TCCCGCCCGG GACCAACGCC
GTCATCGACC TGAGCCCGAA CACCACGAAC GTGCTCAATA TCAGTGCGAC CGACCTGGAC
AACATCAGCG TCCTGACCTT CGCCGACCAG CCGACGGCGA GCAGCCCGTT GCTGGTCAAC
GTCAACACCA CTGACGTGGG AGACTCCTTC TCCTGGACCT CGCCGACGTT CGCGGGGGTC
GGAGGTCCAG AGGCCCGCTT CATCCTGTTC AACTTTCCCA CCGCGACCTC GCTCACCCTC
GCGGCCGGCG GCGCCACCGT CGAGGGAACG ATCTACGCGC CACGCGCTGA CTACACCGAC
CTCGACCAGA GCAACACCGA AGGGAACATC ATCGCGCGCA CGTTGGCGCA CCGTGGCGGG
GAGATCCACG ACTACCCCTT CAGCACCACC CTCGCCTGCA GCAATGGAAC CCCCACGCCC
ACACCGAGCG TCACACCCAC AGTGACACCG ACCGCCACGC CCACGGAGAC ACCGAGTCCG
ACCAGAACCG CGTTGCCGAT CACCGGCTCC TCCGGCGGGT CCATGCTGTT CGGCGGCACC
CTGGCCGTGC TGGCGGGCGC CGTGCTTCTC GCCACACTCG GTTTCGCCCA CTGGCGACGC
CGCAACCGGC ACAGCTGA
 
Protein sequence
MKTRTEMGAA GAAGTAAVAA ATAFAMMGNA VSAAPVGPIN PVLATEGFLV MTEGDANLIG 
AENEGTLAVG GNLTFAQYQL ATVSAGNFVD LGDNNPTALV VNERVDFVNS TPGTRLQVLQ
NGYVKIRNLT DTFVRNVDNN NAAVNTRVLP VDDYDAFPRV ELVTRQPVSS VGPAFVLDIA
AAFQSFRSAS TELATCQNTV ALRDPNGDPI PTPIPPGTNA VIDLSPNTTN VLNISATDLD
NISVLTFADQ PTASSPLLVN VNTTDVGDSF SWTSPTFAGV GGPEARFILF NFPTATSLTL
AAGGATVEGT IYAPRADYTD LDQSNTEGNI IARTLAHRGG EIHDYPFSTT LACSNGTPTP
TPSVTPTVTP TATPTETPSP TRTALPITGS SGGSMLFGGT LAVLAGAVLL ATLGFAHWRR
RNRHS