Gene Sare_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3018 
Symbol 
ID5707359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3431337 
End bp3433148 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content62% 
IMG OID641272465 
Producthypothetical protein 
Protein accessionYP_001537833 
Protein GI159038580 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0249937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGCC TGGTGGGGGC TGGTCCCAGG GGACTCGCCG TCCTGGAGCG ATTGTGCGCC 
AATCATGCTG GCGAGAACGA GCTGGTGATC CACGTAGTGG ATCCCTTTCC GCCGGGGTCA
GGGAGGATCT GGCGGGAGCA GCAGTCACCA CAACTGCTGA TGAATACGGT GTCGTCCCAG
GTCAGCCAGT TCACCGATGA GAGTATCGAC TGCTCGGGTC CGATCAGGCC GGGCCCGAGT
TTGCACGGTT GGCTGCAAAG CCACGACTCT GATGCTGACC AGGGGCGACG GGGCCCTGAC
GACTATCCCT CTCGGAGATT GTACGGCCGC TATCTGAGAT GGGTGTTTGA TCGCGTGGTA
GCCGATGCGC CTGACACCGT CCGCGTCGTT GTCCATCGGG CCACAGCGGT GGCGTTGGAG
GACGCGGACA ACGGCCAGTG CGTCACCCTG GATGACGGAA ATCGACTGGC GGGGATGGAC
GCCGTGGTTC TGTCACTCGG ACACAGCGAT ACGGAGCTGA CCGGCAAGGA ACGCCAGTTG
GCGGGTTTCG CCGAACGTCA CGGCCTGCGT TACTTTCCTC CCGCAAACCC AGCCGATCTC
GATTTCGATA AGATCGATGC CGGGGAGGCA GTCGGTGTCC GGGGACTCGG GCTCGCATTC
TTTGACGTCC TGGCACTGGT TATGGAGGGT CGAGGCGGTC GGTTCGTGCC TTCCGACAAG
GGTCTACGGT ATCGACCGTC CGGTCGCGAA CCAACTCTGT ACGCCGGAGC CCATCACGGG
ATTCCCGACT ACGCGAGGGG GAGAAATCAG AAGGGTGTCG CCGGCAAGCA TCGTCCCCGG
TTCTTGACTG CCGACGCCGC CCACCGTATT CGCGAAAACC CGGAGGCGAC GTTCCGGCGG
GACGTGTGGC CCTTACTGGA TGCCGAAGTT CGTACGGTCT ATTATCAGGC ACTCGTCGCA
CAGCGCGGCG GTTCCTCCGC CGCCAGCAAC TTCCTGAAAG ATTACCTTTC TGAACCCGAC
GATCCAGGAA CACTTCGACG GCACGGCCTG ACCGCATCGG ACGAATGGAG TTGGGAGAGG
CTCGGCCAGC CGTGGCGGCC CCACGAATTC TCTGATCATG CGACGTTCAA TCAATGGCTA
CTCGGTCACC TCCGCGAGGA CATCGGCCAT GCTGAGGTCG GTAACGTCGA CGACCCGGTC
AAAGCCGCCC TGGACGTCAT ACGAGACCTG CATAAAGAGA TTCGACTGGC AATTGACCGC
TCGGGTGTCA TCGGATCGTC CTATCGCGAC GAAGTGATCC ACTGGTTCAC GCCACTGAGT
ACTCTTTTTT CTGCTGGTCC GCCGCCCCTG CGGGTCGAAC AGATGGCGGC ACTTATCGAG
TGCGGTCTGC TGCAGGTCGT CGGCCCGGAG CCCCAGGTGC GAACGGACCC TACCGGCGCC
TGCTTCCTCA TCGGCTCCGC CACGATACCT GGAAAACAAA TACGGACAAC ATCATTGATT
GAGGCGCGTA TCCCGAAACC AGACCTGAAG CACAGCGCCA ATCCGCTGCT GTGCTTCCTG
GTCGAAACAG GGCAGTGTCG TCCATATCAT ATTCCGGACC CCGACGGTGC CTACGAAAGC
GGCGGGCTTG ACGTCACCCC ACGGCCATAT CGCCTGATAG ACGCCGCCGG CGTCCCCCAT
TCGCGCCGTT TTGCGTACGG ACCACCGACC GAGTCGGTTT TCTGGTTCCT GAACGAAACG
ATTCGTCCCG GCATCGGCTC CATGATTCTC GAGGACGCGG ATGCCATCTC TCGAGCGGCG
CTCACGTGCT AG
 
Protein sequence
MICLVGAGPR GLAVLERLCA NHAGENELVI HVVDPFPPGS GRIWREQQSP QLLMNTVSSQ 
VSQFTDESID CSGPIRPGPS LHGWLQSHDS DADQGRRGPD DYPSRRLYGR YLRWVFDRVV
ADAPDTVRVV VHRATAVALE DADNGQCVTL DDGNRLAGMD AVVLSLGHSD TELTGKERQL
AGFAERHGLR YFPPANPADL DFDKIDAGEA VGVRGLGLAF FDVLALVMEG RGGRFVPSDK
GLRYRPSGRE PTLYAGAHHG IPDYARGRNQ KGVAGKHRPR FLTADAAHRI RENPEATFRR
DVWPLLDAEV RTVYYQALVA QRGGSSAASN FLKDYLSEPD DPGTLRRHGL TASDEWSWER
LGQPWRPHEF SDHATFNQWL LGHLREDIGH AEVGNVDDPV KAALDVIRDL HKEIRLAIDR
SGVIGSSYRD EVIHWFTPLS TLFSAGPPPL RVEQMAALIE CGLLQVVGPE PQVRTDPTGA
CFLIGSATIP GKQIRTTSLI EARIPKPDLK HSANPLLCFL VETGQCRPYH IPDPDGAYES
GGLDVTPRPY RLIDAAGVPH SRRFAYGPPT ESVFWFLNET IRPGIGSMIL EDADAISRAA
LTC