Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4017 |
Symbol | |
ID | 8885218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 4285641 |
End bp | 4287248 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512762 |
Protein GI | 291301484 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00126088 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGT TCGAATTTGG TCTTTTAGGA GCCCTAACGG CGCGGGTGAA CGGCCATGAC GCACCCTTGG GAGGCCTCAA GCCCCGCCGG ATGCTCGCCA CCTTCCTGTT GATGCCCGGT GAACAGCTGC CACTGGACCG GTTCATCGAC GTGGTGTGGG GTGCCCAGCC CCCCAAATCG GCGAGCGCGA ACCTTTATTC CTATGTGACC GTCCTGCGGC GTGCGCTGCA CGGACGACTG AACCGGCTGC GTAGCGGGTA TGTGCTGCAC GTCAAACCGG GCGAGCTCGA CGTCCAGGTC TTCACCGACC TGCTCGTGGA GGCCCGCAGC GAGGCCGCCG CCGGCCACGT CGCCGATTCG CTGGGTGCCT ATGACCGTGC GTTGAAATTG TGGCGTGGTG AGCCGCTGGC CGATATCAAA GGGCCACCGC CATGGATTCC ATATATCCAA AAGTTGATAG ACACACGCCT TGACGCTCTT GAGGAACGTG CCGCTCTTTA TGTTCATAAC GGACAGCAGA ATGAGGCGGT CGCGGAACTT CGTGGTCTGA TCGCGGAACA TCCGCTCCGA GAAAGTCTAT GGCGGCAGTT GATGACCGCA TTGGCCAGTG CCGGGCAGCG TGCCGAAGCC ATCGACACGT ACGGGCGATT GCGCTCGACA CTCGCCGACG AACTGGGCAT CGAACCCAGC GAGGAGTCAC AGCAGGTACA TCGCAAACTA CTCGGCGCAC CCACAGCCCG ATCCCGGCAT ACGGACGTGC GCACCGCCGA GTTGAAACGC CGGTGCACCG ACATGGAGGC CATGGTCCGC GCGGCCGCGG CCACCCTGCC GGTCAGCGCG ATGTCCATGA CCACCCCGGG ACAGACCGAC GCCGACATAC CGGCGCCGGA CAACCCGGGG GCCTGGCTCG AGGACAACGT GGACGACATC CTGGCGCTGG TGCGTCAGGC CGCCACGGCG GGTCTGGCGT CCTCGGCGTG GCGGCTGAGC GCGGCCATGC TGCCGTTCCT GGATTTCCGG ATGCGGCTCG AGGACTGGCG GCGGTGCGTG CGGATCGCGC TGGTCGCGGC CCGCAAGTGC GGTGACACCG AGGGCGAGGC CACGATGTTG CGCAGTCTCG GTCAGTGGCA CATCTACCAC GACCACTTCG AGGCGGCCGA GGAGTGCTTC AACGTGGCCC GGGTGCTGAC CCGGGCGCTG GGCAACGAAC GCGAGACCGC CCTGGCGGTC TACGGACTGG GCGCCGTGGC CCGGTTCACC GGGCGCACGC CCGAGGCGGC CTCGTTGTTC CGCGACTCCG CCAACGCCCT GCACTCCATT GGAGACGCCT ACGGTGAGAG CTACGCCCGC TGGGCGCTGG CCGGGGCCTT CATCGAGCTG GGCAACATCG ACGGCGCCGA GGAGCAGCTC AACACCGCGC TGAAGTCGGC GCGTTCGGTC GGCGACCGGC ACCGCGAAGG CCATGTGCTG GAGCGGTTCG CGGCGGTCTA CCGGGCCCGC GGCAACGATT CCGAGGCGAT CTCCTGTCTG GAGGACGCGC TGGAGATTTT CACTGAACTG GGCGACGTGC CCTGTACGAC CGGGGTCAAG GAGGCCCTGG CGGTGTGA
|
Protein sequence | MPEFEFGLLG ALTARVNGHD APLGGLKPRR MLATFLLMPG EQLPLDRFID VVWGAQPPKS ASANLYSYVT VLRRALHGRL NRLRSGYVLH VKPGELDVQV FTDLLVEARS EAAAGHVADS LGAYDRALKL WRGEPLADIK GPPPWIPYIQ KLIDTRLDAL EERAALYVHN GQQNEAVAEL RGLIAEHPLR ESLWRQLMTA LASAGQRAEA IDTYGRLRST LADELGIEPS EESQQVHRKL LGAPTARSRH TDVRTAELKR RCTDMEAMVR AAAATLPVSA MSMTTPGQTD ADIPAPDNPG AWLEDNVDDI LALVRQAATA GLASSAWRLS AAMLPFLDFR MRLEDWRRCV RIALVAARKC GDTEGEATML RSLGQWHIYH DHFEAAEECF NVARVLTRAL GNERETALAV YGLGAVARFT GRTPEAASLF RDSANALHSI GDAYGESYAR WALAGAFIEL GNIDGAEEQL NTALKSARSV GDRHREGHVL ERFAAVYRAR GNDSEAISCL EDALEIFTEL GDVPCTTGVK EALAV
|
| |