Gene Sare_1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1653 
Symbol 
ID5703554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1903227 
End bp1904420 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content70% 
IMG OID641271159 
ProductXRE family transcriptional regulator 
Protein accessionYP_001536534 
Protein GI159037281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0453749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0870716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGG AAGCAGGCGA GGGTTTTCCC CCGTCTACAG ACACCCCCGC CGACCTCGGC 
GCTCACCTGC GAGCCGCCCG TGAGGCCGCC GGACACAGCC TCGCCGGTAT GGCCGCGCTC
ACCCACTTCA GCAAGCCCTA CCTCAGCCTT GTCGAGACGG GCCGCCGTCA GGCCACTCCC
GACATCGTTG AACGCTACGA ACATGCCCTC GGTGTACCGA TCGGCACCCC AGCCGACCCG
GTCCGCCGAA CCCACGAATG GCTCCTGGAC AGCCCACCCG CCACCGGCTG CCTACGCGCC
GGCCGCCGCA TCGGCGCAAA CCTGATCCGA ACGCTGGAAG CCCGGGTGAT CGACCTCCGC
CACCTGGACG ACACGGTCGG CAGCCGCACC CTGCTTCCCG TCATCCGCGC CGAACTCGAC
CACGCCGAAC ACCTCGCCCA CACCGCCTCC TACACCGACA CCTCCGGTAG ACGGCTGTAC
ACCGTGATCG GTGAACTGGC CCAACTCGCC GGCTGGGTCG CCAGCGACGC CGGCCACTAC
TCCGACGCCC AACGCCTATA CCTATCCGGC GTCACCGCCG CCGACGCAGC CTGCGACCGG
GCGCTGGGCG CGCAACTGCT GTCGAGCCTC GCCTACCAGA TCACCAACAT CGGCAAACGC
GACGACGCCC TGCTCATCGC CCGCTCCGCC GTCACCGGCG CCCCGCACGC CAGCCCGCTC
GTGCGGGCGC TGCTGCTGGA ACGCCTCGCC TGGGCCGCCG CCCGCCTCCG CGACACCGAT
ACCACCCGCC GCGCCCTTGA CGCCGTCAAC GACGCCTACG ACCAACACTG CGACGGTATC
GCCGAGCCCG AATGGGTGTA CTGGCTCAAC CGGACGGAGG TCGACGTCAT GGCCGCCCGC
TGCCTCATCG AACTCGGCAC CCCAGCCGCC GCCGAACCCC TGCTCACCCG AGCGCTCGCC
GGCTACAACC ACGACCACGC CCGCGAAGTC GCCCTCTACC AAACCTGGCT TGCCGAAGGC
CACGCCAAAA CCGGCAACCT CGACGCCGCC CGCGCCGTCC TGCACCGCAT CGACACCACC
GCCATCGACG CCGGCTCCAC CCGCCTGCAC CGCCGCATCA CCGCCGTCGA CCGCCTCATC
AACCGCCGCG CACAGAAGAA GCCCGCCAAC AGCACCAGAC GACCGACCGA GTAG
 
Protein sequence
MTMEAGEGFP PSTDTPADLG AHLRAAREAA GHSLAGMAAL THFSKPYLSL VETGRRQATP 
DIVERYEHAL GVPIGTPADP VRRTHEWLLD SPPATGCLRA GRRIGANLIR TLEARVIDLR
HLDDTVGSRT LLPVIRAELD HAEHLAHTAS YTDTSGRRLY TVIGELAQLA GWVASDAGHY
SDAQRLYLSG VTAADAACDR ALGAQLLSSL AYQITNIGKR DDALLIARSA VTGAPHASPL
VRALLLERLA WAAARLRDTD TTRRALDAVN DAYDQHCDGI AEPEWVYWLN RTEVDVMAAR
CLIELGTPAA AEPLLTRALA GYNHDHAREV ALYQTWLAEG HAKTGNLDAA RAVLHRIDTT
AIDAGSTRLH RRITAVDRLI NRRAQKKPAN STRRPTE