Gene Sare_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2638 
Symbol 
ID5706901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3004413 
End bp3006083 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID641272098 
Productparallel beta-helix repeat-containing protein 
Protein accessionYP_001537468 
Protein GI159038215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0177444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.259546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACC AGGGTCTGTC CACTGCGGTT CCGAAGCGGC TCACGCCGGT GGCCGGCGCC 
CCGCTGTGGT GCGTGGCCAC CGAGCACGGG CTGCGCGGCG ACGGCGTCAC CAACGACCAG
CCGGCCCTCG CCGCCCTCGT TGACCTGCTC GGTGAGGGGT ATGCGGCCGA CGGGCGAGCG
CGGGTCATCT ACTGCCCGCC GGGCATCTAC TCGATCCGGG ACGCCGGCAC GGTGTGGCGC
AGCGGGGTGT CGCTGATCGG CGCCGGGCCG GCGGCGACCC GGTTTTTGCT CAGCAACGAG
GGGGAGCGGT CGGCGCCGGT GCCGTTGGCC TTCTGGACGA CGGTGCAGCA CGGCGCGGAC
CGGGACCGGC ACATCGCCGA GGTCACCTTC GCCGACTTCG AGATCGACGG CTCCGGGGTG
GCGATGACCG AGTACTCCTA CCTTGCCAAA GGCCTGGGCT TGCAGTACGT GGTGCGTGGG
GTGTTCCGCA ACCTCTACAT CCACCACACC GGTGCCACCG GGCTGGGCTG TGACTTCCTT
CAGGACACGT TGATCGAGGG GTGTGTGGTG GTCGGCTGCG GCCGACTGGA CAACGGCCTT
CAGATGGGCG GCGCGGGCAT CGGCATCGGC GTTGGGGGCT GGGGTGAGGT CGAGCGGTGC
ACGATCGCCA ACTGCACCAC GGTCGGCAAC GGCACCAACG GGATCTTCCT CGAGTTGCAG
AAGTCGTACT GGATCCCGCC GCGTGGCTAC CGGATCGTCG GCTGCCACAG CCAGGCCAAC
CGATTCGGCA TCTCCGACTG GGGCGCCGAC GGCCTCGTGG TCACCTCCTG CACCCTCACC
TCGAACCTGG AGGCGGGCTT CGACGTCTCC GCCAACGGCA CCGCCAGCGT CGCCGGCCGT
GGCGGCATCC TCGCCGACTG TGTCATCGAC CGCAACATCG GCGACGGCAT CAGCATGGGG
AACACCCCGG GTCCGTACAC CATCCGGGGC AACCGGATCA GCGGCAACGG TGGGTACGGC
TACCACGAGC ATGACCTCGG CAACGGCTTC CGGGGCCCGT CCGCGTCCGT GGTGATCGAG
GGCAACGACC TCGCCGACAA CGCCTTGGAC GCGATCCGGA TCGACCGGCC GATGGTGGAC
GCGTTCGTGG TCGGCAACCG GATCCGCGAC AACGGACGCC GCTTCGCTCC CGCCGTCGTC
GGTTCCGGTG CTTCCGTGCG GTACGGGCGC AAGTCGGTGA CCGACGGGGC CGCGACCTGG
CCACCGGACG GGCACCGGGG CAAGGTGGTC GAGGTGGACG GCCGGAGGGC CGTGGTCGCC
TGCAACTCCG ACTCTGAACT CACCCTGGCC GAGGTACGGC CCGACGCCGT CACCGGCTGG
AGCGAGGACG TGCCGCCGCC GGGTAGCCAG TACCGGCTGC CGGATCCACC GGTGGAGCGG
GCCGGGATCA CGATCAACGC GGCGGTGGAC TCGACGACGG TGCGCGGCAA CCGGGTCTGG
GGTACTGGCG CCGCCACGCA GACCTACGGG CTGTGGATCA CCGAACACGG CAGCTGCGTG
GACGGCCGGG TCGAGGACAA CGACATCACC GGCAACGCCG AGGAGGCGAT CCGCCTGGAC
ACCCCACCGC TGGGCGGCAG GTGGGCCCGC AACTACACCG ACGAGGGCTG A
 
Protein sequence
MSHQGLSTAV PKRLTPVAGA PLWCVATEHG LRGDGVTNDQ PALAALVDLL GEGYAADGRA 
RVIYCPPGIY SIRDAGTVWR SGVSLIGAGP AATRFLLSNE GERSAPVPLA FWTTVQHGAD
RDRHIAEVTF ADFEIDGSGV AMTEYSYLAK GLGLQYVVRG VFRNLYIHHT GATGLGCDFL
QDTLIEGCVV VGCGRLDNGL QMGGAGIGIG VGGWGEVERC TIANCTTVGN GTNGIFLELQ
KSYWIPPRGY RIVGCHSQAN RFGISDWGAD GLVVTSCTLT SNLEAGFDVS ANGTASVAGR
GGILADCVID RNIGDGISMG NTPGPYTIRG NRISGNGGYG YHEHDLGNGF RGPSASVVIE
GNDLADNALD AIRIDRPMVD AFVVGNRIRD NGRRFAPAVV GSGASVRYGR KSVTDGAATW
PPDGHRGKVV EVDGRRAVVA CNSDSELTLA EVRPDAVTGW SEDVPPPGSQ YRLPDPPVER
AGITINAAVD STTVRGNRVW GTGAATQTYG LWITEHGSCV DGRVEDNDIT GNAEEAIRLD
TPPLGGRWAR NYTDEG