Gene Sare_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0594 
Symbol 
ID5704179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp674801 
End bp676543 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content67% 
IMG OID641270120 
Producthypothetical protein 
Protein accessionYP_001535513 
Protein GI159036260 
COG category[S] Function unknown 
COG ID[COG5293] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.179265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGGC AACTCAGCGC CTCTGACCCA CGCTTCAAAA CCATCGACCT CCACGACGGT 
CTCAACGTCC TCGTCGCCGA CACGACCAGC ACCTCCCGCG ACACCGACAG CCGCAACAGC
GCCGGCAAGT CCAGCATGAT CGAACTGCTG CACTTCCTCC TTGGCGGCGG CGGCGACAAG
AACTCACTGC CCGCGCACCG CCAGTTGCGC GACACCACGT TTGGGCTCCG GTTGGACTGG
CCAGGCATAC CCGACGGGCT GCGCGTGCAA CGAAGCGCAT CACGCACCAC GGCGATCAGC
TTGCGACCCA TGCCACCCGG TGCCGCCCCC GACCAGCTCG ACCTGGGAGC CGGTGAAATC
CAGCTACCGG AGTGGCAGCG GCTCATCGAA CGCGACCTGT TCCGGCTGCC GACCGAGCAC
CCCGGGGTCA GCGGGCGGGC GCTGCTGTCC TTCCTGATGC GCCGCATCAG CTCGCACGCC
TTTAACGGCC CTACCCGTAC CCACTCCCGG CACTCCGACG CTGAAGGAAC CACCAACCTG
GCCTACCTGC TCGGCCTGGA CTGGCAACTG GCCGACCGGT ACCGCACGCT CAAAGCCCGC
GAGACCACCA GCAAGCAACT GCGCAACGCA GCCAACGATC CGATCCTCGG GCGCATCGTC
GGTAAGGCGT CCGAGCTACG CGGTCAGATC GCCGTCGCCG AACACCGCGT CAACGAAGTT
GAACGGCAGG TCGCCGGGTT CCGCGTCGTG CCCGAGTACG AGAACCTGCG CCGCCGCGCC
GACGAGATGG ATCAACGGAT CCGGGCGCTG CGCAACCAGG ACGTCCTGGA CCGACGCAAC
CTGGCAGACC TCGAAAAGGC CGTCGAGGAA GCCGTCGACC CGGACATCGC CTACCTCGAA
TCCACCTACG CCGAACTCGG CGTCGTCCTC AGCGACCAGG TCCGCCGCAG CTTCGCCGAC
GTCGAAGCCT TCCACAACTC CGTCGTACGC AACCGACGCC GCTACCTCGG CGAAGAAGCC
GCTTTGATCC GCACCCGCCT CACCGAACGC GAAGCCGAAC GGGAACGCCT CGGCACCGAA
CTGGCCACCG TGCTGCGCGC CCTCAACGAA GGCGGAGCTC TCGACGCGCT CACCACACTC
CAGCAGGTCC TCGCCCAAGA ACAGGCCATG CTCGCCGCAC TGCGCCACAG GTACGAAGCG
GCGCAGGCCC TGGAAGCCAG CCGCCGCAGC ATCGCGACCG ACCGCATCGC ACTGCAACAG
GCCACCGAAG CAGACATCGA CGAACGCCGC CGAATCGTCG ACGAAGCGGT CGTGCTGTTC
GCCGAGTTCG CCCAGACCCT CTACGGCGAA GGCCGCGAAG CGTACCTGCG CTTCTCACCC
GGCGACAGCA GCCTGCGCAT CGACCCGCAC ATCGCCAGCG ACAACAGCGG CGGCATAGGC
AACATGGTCA TCTTCTGTTT CGACCTCACC GTCGCCGTCC TCGGCCACCG GGGACGCCGC
GCCCCGAACT TCCTCGTCCA TGACAGCCAC CTCTTCGACG GCGTCGACGA ACGACAGGTC
GCCCGAGCAC TCACCCTCGC TGCCGAGGTC GCCCGCCGCG AGCAGATGCA GTACATCGTC
ACGCTTAACT CCGACGAGCT GGACAAGGCC GTCCGCCGAG GCTTCGATCC GACCGGGCGC
GTGCTCGAAC AGCGCCTGAC CGACGCCTAC GAATCAGGCG GCCTATTCGG TTTCCGGTTC
TGA
 
Protein sequence
MLRQLSASDP RFKTIDLHDG LNVLVADTTS TSRDTDSRNS AGKSSMIELL HFLLGGGGDK 
NSLPAHRQLR DTTFGLRLDW PGIPDGLRVQ RSASRTTAIS LRPMPPGAAP DQLDLGAGEI
QLPEWQRLIE RDLFRLPTEH PGVSGRALLS FLMRRISSHA FNGPTRTHSR HSDAEGTTNL
AYLLGLDWQL ADRYRTLKAR ETTSKQLRNA ANDPILGRIV GKASELRGQI AVAEHRVNEV
ERQVAGFRVV PEYENLRRRA DEMDQRIRAL RNQDVLDRRN LADLEKAVEE AVDPDIAYLE
STYAELGVVL SDQVRRSFAD VEAFHNSVVR NRRRYLGEEA ALIRTRLTER EAERERLGTE
LATVLRALNE GGALDALTTL QQVLAQEQAM LAALRHRYEA AQALEASRRS IATDRIALQQ
ATEADIDERR RIVDEAVVLF AEFAQTLYGE GREAYLRFSP GDSSLRIDPH IASDNSGGIG
NMVIFCFDLT VAVLGHRGRR APNFLVHDSH LFDGVDERQV ARALTLAAEV ARREQMQYIV
TLNSDELDKA VRRGFDPTGR VLEQRLTDAY ESGGLFGFRF