Gene Snas_5544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5544 
Symbol 
ID8886758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5890378 
End bp5891769 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content68% 
IMG OID 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003514268 
Protein GI291302990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.258636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.142482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTTTGA CTCATCCACC GTCCGCACTC ATCGACGGCT CGCCCCGGCT GCGGCTACGG 
TCGCCGCCCA CGTACTGGCG CTGTGAACCG TCCTGGGTGT GGCAGGCGCG GCCGCTGACC
GACCATCTGC TGTGGCATGT GCACGACGGC GCCGGGGAGC TGCGGCTCGA CGACAGGCAC
GTGCGGCTGC GGCCGGGGTT CTGCGCGGTG TTCGCGCCCG GCGACGCGCC GGTGGCCACC
CACGATCCGC GACATCCGCT GCTGGTCTTC GGCATGCACT TCACCGCCAC GGGCTTCGAC
GGCGATCTGG TCCCCGAGCA ACGCTGGTGC CGGTTGTGGG ACCAGGAGTT CGCCGTCCGG
CTGGCGCGGC ACTGCGACTA CGCCCACCGA CGCGGCGACG ACCTCGGGCA GCGGCAGGCG
GTGCTCGGCC TGGAACAGTT CCTCTGTCTA TTGTGGGATA ACGTGACGCG TCCAGCTCCC
GGCCCCGGGG ACGCCGCCGT CGAGGACATC GCCCGGGCGG TCCGGCAGGA ACCCAGCCGC
GACTGGACCG TCGCGACGCT CGCCAAGCGG GCCAACCTTT CCCGTGCCCA GTTCACCCGC
CGCTTCACCG CCCACACCGG CTACTCCCCC GCGCGGTACG TGATCCGGGC CCGGCTCGAC
CGGGCCCGGC AGCTGTTGAC CGAGACCAAC ATGTCCGTCG GCCAGGTCGC CGCCACCCTG
GGGTACCCCG ACGTCGGTTA TTTCAGTCGC CAGTACAAAG TCCACACCGG TTCCTCACCC
AGTAGGGATC GCGGTGGCGG CGAAACTTCC GTAGATTGGT TAATGCGGGT ACCAACAGTG
CCGTTGTCCC CTAACATCCA GGGGCGATCA GACTATAGGA GCCGCAGCGT GATCACGGAA
TACCAAGCGC CCAACGGCTT CGCCACCATC CCCAGACAAC GGTCCCTTCG AGAACACCCC
CTGATCGCCG CGTTGTTGTC CCTCGAACTC GATCCCCGTG ACTACGTCAT CTTCGGCAGC
GGGCCGCTGC TGGCCCACGG GTTGCGCGCG GACGTCGCCG ACCTGGACGT CGTCGCCCGA
GGACGGGCCT GGCACCGGGC ACTGCGGATG TCCGGCTACT CCATCGATCT GGGACCGCAC
AGCTTCGAGC TCATGCTGCG GTTCTTCGGC GGCGGCGTCG AGATCTCCGC CTACTGGACC
GACACCAGCT GGGACGTCGA CGCGCTCATC GACACCGCCG AGATCATCGA CGGACTCCGG
TTCGCCGCGC TGCGGGATGT GCTGGAGTAC AAGCGAACCC TCGATCGCGC CAAGGACCGC
GCCGATGTCG CCGCCCTGGA GCGACACCTG TCCACGGCGG AAGAACTCGA ACCGGTGCTG
TGCGCGGCCT GA
 
Protein sequence
MFLTHPPSAL IDGSPRLRLR SPPTYWRCEP SWVWQARPLT DHLLWHVHDG AGELRLDDRH 
VRLRPGFCAV FAPGDAPVAT HDPRHPLLVF GMHFTATGFD GDLVPEQRWC RLWDQEFAVR
LARHCDYAHR RGDDLGQRQA VLGLEQFLCL LWDNVTRPAP GPGDAAVEDI ARAVRQEPSR
DWTVATLAKR ANLSRAQFTR RFTAHTGYSP ARYVIRARLD RARQLLTETN MSVGQVAATL
GYPDVGYFSR QYKVHTGSSP SRDRGGGETS VDWLMRVPTV PLSPNIQGRS DYRSRSVITE
YQAPNGFATI PRQRSLREHP LIAALLSLEL DPRDYVIFGS GPLLAHGLRA DVADLDVVAR
GRAWHRALRM SGYSIDLGPH SFELMLRFFG GGVEISAYWT DTSWDVDALI DTAEIIDGLR
FAALRDVLEY KRTLDRAKDR ADVAALERHL STAEELEPVL CAA