Gene Snas_4536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4536 
Symbol 
ID8885741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4839889 
End bp4841568 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content68% 
IMG OID 
ProductRNA polymerase sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003513274 
Protein GI291301996 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.222967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.785726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGACG CGAGCCCCAA CGGTACTGAT GTCCGTTCAC TCACGGAATC GCTGCTCCAA 
CTGGCAGGCG AACGGGGTGG TCAACTCGCC TCCGCCGAGG TTGCCACATT CCTTGAGTCG
GCGCAGGTGG CCCCGGCGCA GGGCAAGAAG ATCCTGCGCG CCCTGGCCAA CGCCGACGTG
ACCGTCGTCG TGGACGACTC GGCCAAGCCG CGCCGTGCGG TTCCGGCGGC GCGCTCGGCG
ACCGCCGCCT CCAAGGCGAC CACGGCCAAG GCCGAGACCA AGCAGCCGGC GGCGAAGAAG
GCGGCCAAGA AGGCCACCAA GGCCGCGGCC AAGAAGACCA CCGCCAAGAA GAAGGCGACG
GACGAGGCCG CGACCGACGC CGAGCTCACC GACGTCGAGC AGGCTGACGG CGAGGCCAAG
ACCGCGGCCA AGAAGACCGC GGCGAAGAAG ACCGCCAAGA AGGCCACCAA GGCCGCCGTC
AAGAAGACGG CGAAGAAGGC GACCGCCAAG AAGGCCACCG ACGCCAACGG CGACGAGGAG
CCGACCGCCG CCGACCTGGC CGCCGCCGAG GGCGATGACG ACCTCGCCGC CGAGGCCGCC
GCCAAGAAGT CAACCCGCAA GTCCGCGAAG AAGACCGCCA AGAAGTCGTC CAAGAAGGAC
GGCAAGAAGG ACGAGGACGG CGAGGGCTCG GACGGCTTCG ACTGGGACGA CGAGGACTCC
GAGGCGCTGA AGCAGGCCCG CAAGGACGCC GAGATGACCG CCTCGGCGGA CTCGGTACGC
GCCTACCTCA AGCAGATCGG CAAGGTGCCG CTGCTGAACG CGGCCCAGGA AGTCGAACTG
GCCAAGCGCA TCGAGGCGGG TTTGTACGCC GTCGAGCGGC TGCGTCAGCT CAAGGAGGCC
GGCGAGGAGA TCCCGACCCA GGTCCGCCGG GACCTGGAAT GGGTCACCCG CGACGGCGAC
CGCGCCAAGA ACCACCTGCT GGCCGCGAAC CTGCGGCTGG TGGTGTCGCT GGCCAAGCGC
TACACCGGTC GCGGCATGGC GTTCCTGGAC CTGATTCAGG AGGGCAACCT CGGCCTGATC
CGCGCCGTGG AGAAGTTCGA CTACACCAAG GGCTTCAAGT TCTCCACCTA CGCCACCTGG
TGGATCCGCC AGGCCATCAC CCGCGCCATG GCCGACCAGG CCCGCACCAT CCGGATACCG
GTGCACATGG TCGAGGTCAT CAACAAGCTG GGCCGGATCC AGCGCGAGCT GCTGCAGGAC
CTCGGACGCG AACCCGCGCC GGAGGAACTG GCCAAGGAGA TGGACATCTC CCCGGAGAAG
GTCCTGGAGA TCCAGCAGTA CGCGCGCGAG CCGATCTCGC TGGACCAGAC CATCGGCGAC
GAGGGCGACA GCCAGCTGGG TGACTTCATC GAGGACTCCG AGGCGGTCGT GGCCGTCGAC
GCGGTGTCGT TCTCGTTGTT GCAGGGCCAG TTGCAGCAGG TGTTGCAGAC CCTGTCGGAA
CGTGAGGCGG GCGTGGTACG GCTGCGCTTC GGTCTCACCG ACGGTCAACC GCGCACTTTG
GACGAGATCG GGCAGGTCTA CGGAGTGACG CGGGAGCGGA TCCGGCAGAT CGAGTCCAAG
ACGATGTCGA AGCTGCGGCA CCCGTCCCGC TCCCAGGTTC TGCGCGACTA CCTCGACTAG
 
Protein sequence
MTDASPNGTD VRSLTESLLQ LAGERGGQLA SAEVATFLES AQVAPAQGKK ILRALANADV 
TVVVDDSAKP RRAVPAARSA TAASKATTAK AETKQPAAKK AAKKATKAAA KKTTAKKKAT
DEAATDAELT DVEQADGEAK TAAKKTAAKK TAKKATKAAV KKTAKKATAK KATDANGDEE
PTAADLAAAE GDDDLAAEAA AKKSTRKSAK KTAKKSSKKD GKKDEDGEGS DGFDWDDEDS
EALKQARKDA EMTASADSVR AYLKQIGKVP LLNAAQEVEL AKRIEAGLYA VERLRQLKEA
GEEIPTQVRR DLEWVTRDGD RAKNHLLAAN LRLVVSLAKR YTGRGMAFLD LIQEGNLGLI
RAVEKFDYTK GFKFSTYATW WIRQAITRAM ADQARTIRIP VHMVEVINKL GRIQRELLQD
LGREPAPEEL AKEMDISPEK VLEIQQYARE PISLDQTIGD EGDSQLGDFI EDSEAVVAVD
AVSFSLLQGQ LQQVLQTLSE REAGVVRLRF GLTDGQPRTL DEIGQVYGVT RERIRQIESK
TMSKLRHPSR SQVLRDYLD