Gene Snas_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4017 
Symbol 
ID8885218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4285641 
End bp4287248 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content67% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512762 
Protein GI291301484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00126088 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGT TCGAATTTGG TCTTTTAGGA GCCCTAACGG CGCGGGTGAA CGGCCATGAC 
GCACCCTTGG GAGGCCTCAA GCCCCGCCGG ATGCTCGCCA CCTTCCTGTT GATGCCCGGT
GAACAGCTGC CACTGGACCG GTTCATCGAC GTGGTGTGGG GTGCCCAGCC CCCCAAATCG
GCGAGCGCGA ACCTTTATTC CTATGTGACC GTCCTGCGGC GTGCGCTGCA CGGACGACTG
AACCGGCTGC GTAGCGGGTA TGTGCTGCAC GTCAAACCGG GCGAGCTCGA CGTCCAGGTC
TTCACCGACC TGCTCGTGGA GGCCCGCAGC GAGGCCGCCG CCGGCCACGT CGCCGATTCG
CTGGGTGCCT ATGACCGTGC GTTGAAATTG TGGCGTGGTG AGCCGCTGGC CGATATCAAA
GGGCCACCGC CATGGATTCC ATATATCCAA AAGTTGATAG ACACACGCCT TGACGCTCTT
GAGGAACGTG CCGCTCTTTA TGTTCATAAC GGACAGCAGA ATGAGGCGGT CGCGGAACTT
CGTGGTCTGA TCGCGGAACA TCCGCTCCGA GAAAGTCTAT GGCGGCAGTT GATGACCGCA
TTGGCCAGTG CCGGGCAGCG TGCCGAAGCC ATCGACACGT ACGGGCGATT GCGCTCGACA
CTCGCCGACG AACTGGGCAT CGAACCCAGC GAGGAGTCAC AGCAGGTACA TCGCAAACTA
CTCGGCGCAC CCACAGCCCG ATCCCGGCAT ACGGACGTGC GCACCGCCGA GTTGAAACGC
CGGTGCACCG ACATGGAGGC CATGGTCCGC GCGGCCGCGG CCACCCTGCC GGTCAGCGCG
ATGTCCATGA CCACCCCGGG ACAGACCGAC GCCGACATAC CGGCGCCGGA CAACCCGGGG
GCCTGGCTCG AGGACAACGT GGACGACATC CTGGCGCTGG TGCGTCAGGC CGCCACGGCG
GGTCTGGCGT CCTCGGCGTG GCGGCTGAGC GCGGCCATGC TGCCGTTCCT GGATTTCCGG
ATGCGGCTCG AGGACTGGCG GCGGTGCGTG CGGATCGCGC TGGTCGCGGC CCGCAAGTGC
GGTGACACCG AGGGCGAGGC CACGATGTTG CGCAGTCTCG GTCAGTGGCA CATCTACCAC
GACCACTTCG AGGCGGCCGA GGAGTGCTTC AACGTGGCCC GGGTGCTGAC CCGGGCGCTG
GGCAACGAAC GCGAGACCGC CCTGGCGGTC TACGGACTGG GCGCCGTGGC CCGGTTCACC
GGGCGCACGC CCGAGGCGGC CTCGTTGTTC CGCGACTCCG CCAACGCCCT GCACTCCATT
GGAGACGCCT ACGGTGAGAG CTACGCCCGC TGGGCGCTGG CCGGGGCCTT CATCGAGCTG
GGCAACATCG ACGGCGCCGA GGAGCAGCTC AACACCGCGC TGAAGTCGGC GCGTTCGGTC
GGCGACCGGC ACCGCGAAGG CCATGTGCTG GAGCGGTTCG CGGCGGTCTA CCGGGCCCGC
GGCAACGATT CCGAGGCGAT CTCCTGTCTG GAGGACGCGC TGGAGATTTT CACTGAACTG
GGCGACGTGC CCTGTACGAC CGGGGTCAAG GAGGCCCTGG CGGTGTGA
 
Protein sequence
MPEFEFGLLG ALTARVNGHD APLGGLKPRR MLATFLLMPG EQLPLDRFID VVWGAQPPKS 
ASANLYSYVT VLRRALHGRL NRLRSGYVLH VKPGELDVQV FTDLLVEARS EAAAGHVADS
LGAYDRALKL WRGEPLADIK GPPPWIPYIQ KLIDTRLDAL EERAALYVHN GQQNEAVAEL
RGLIAEHPLR ESLWRQLMTA LASAGQRAEA IDTYGRLRST LADELGIEPS EESQQVHRKL
LGAPTARSRH TDVRTAELKR RCTDMEAMVR AAAATLPVSA MSMTTPGQTD ADIPAPDNPG
AWLEDNVDDI LALVRQAATA GLASSAWRLS AAMLPFLDFR MRLEDWRRCV RIALVAARKC
GDTEGEATML RSLGQWHIYH DHFEAAEECF NVARVLTRAL GNERETALAV YGLGAVARFT
GRTPEAASLF RDSANALHSI GDAYGESYAR WALAGAFIEL GNIDGAEEQL NTALKSARSV
GDRHREGHVL ERFAAVYRAR GNDSEAISCL EDALEIFTEL GDVPCTTGVK EALAV