Gene Snas_5201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5201 
Symbol 
ID8886410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5522329 
End bp5523510 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative RNA polymerase sigma-24 subunit, ECF subfamily 
Protein accessionYP_003513929 
Protein GI291302651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.603065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.598241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCCGCC GCTACGGCCA CTTCGACGAG AGCGAGGACG CCGTCCAGGA GGCGCTGCTG 
ACCGCCGCCA CCCGCTGGCC CACCGACGGC CTCCCCGACA ACCCGCGCGC CTGGCTCATC
ACCGTCGCCT CCCGCCGGCT CACCGACCAG CTGCGCAGCG ACGAGGCCCG CCGCCGCCGC
GAGGACACCG TCGCGGCCCG CCAGCTCCCC GAGGACACCC AGGCCCCCGC CGCCGACGCC
CCCGACACCA ACGCGGACGA CACCCTGATC CTGCTGTTCA TGTGCTGCCA CCCCTCGCTG
ACGGCCGCGT CCCAGATCGC CCTGACCCTG CGCGCGGTCG GCGGTCTGAC CACCGCCGAG
ATCGCCCACG CCTTCCTGGT CCCCGAAGCG ACCATGGCCC AACGCATCAG CCGCGCCAAA
CAACAGGTCA AGGCGTCCGG GCTGCCCTTC CAAATGCCAC CGGCACCCGA GCGGGCCGCG
AAACTGGGCG CCGTGCTGCA CGTCCTCTAC CTGATCTTCA ACGAGGGCTA CACCGCCACC
TCCGGCCCCA ACCTGCGACG CGCGGAACTG TCGAACGAGG CCATCCGCCT CACCCGAGCC
GTCCACCGAC TGCTGCCCGA CGACGGCGAG GTCACCGGCC TGCTGGCCCT GATGCTGCTG
ACCGACGCCC GCCGCGACGC CCGCAACACC GCCACCGGCG ACCTCGTCCC ACTCGCCGAC
CAGGACCGCT CCCGCTGGGA CCGGCGATCC ATCGCCGAAG GCGTCGACCT CATCAGCCAC
GCGCTGGCCA CCGCGCCCCC TGGCCCCTAC CAGGTCCAGG CCGCGATCGC CGCCATCCAC
GACGAGGCCC CCAGCACCGA AGCCACCGAC TGGCCCCAGA TCGTCGCGCT GTACGCCGTC
CTGGACAACC TGGCCCCCGG CCCCATGGTC ACCCTCAACC AGGCCGTCGC CGTGGCCATG
GTGGACGGAC CCCGGGCCGG GCTGGAACTG TTGTCCCGCC TCGACGACGA CCCCCGCATG
GCCCGGCACC ACCGCCTGGA GGCCGTCCGC GCGCATCTCT ACGAAATGGA CGGTGACCCC
GCCGCCGCCC GCGCCGCCTA CCTCGCCGCC GCCACCCTCA CCACGAGCCT CCCCGAACAG
GACTACCTGC GCTGGCGGGC CGACAAGCTG CCCGAATCGT GA
 
Protein sequence
MVRRYGHFDE SEDAVQEALL TAATRWPTDG LPDNPRAWLI TVASRRLTDQ LRSDEARRRR 
EDTVAARQLP EDTQAPAADA PDTNADDTLI LLFMCCHPSL TAASQIALTL RAVGGLTTAE
IAHAFLVPEA TMAQRISRAK QQVKASGLPF QMPPAPERAA KLGAVLHVLY LIFNEGYTAT
SGPNLRRAEL SNEAIRLTRA VHRLLPDDGE VTGLLALMLL TDARRDARNT ATGDLVPLAD
QDRSRWDRRS IAEGVDLISH ALATAPPGPY QVQAAIAAIH DEAPSTEATD WPQIVALYAV
LDNLAPGPMV TLNQAVAVAM VDGPRAGLEL LSRLDDDPRM ARHHRLEAVR AHLYEMDGDP
AAARAAYLAA ATLTTSLPEQ DYLRWRADKL PES