Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0797 |
Symbol | |
ID | 8881981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 843010 |
End bp | 846015 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003509602 |
Protein GI | 291298324 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTC GGATTCTCGG TGCGCCCGAG GTGATACGCG ACGGTGCGCC GCTCGCTATC CGCGGCCGCA TCGCGCCGAG GCTGCTGGCC GTCCTGGCCC TGGAGGCCGG ACACGTGGTA CCGATATCCA CTTTGGTGGA AGCCCTGTGG GAGGACCCGC CGGTCACCGC CCGGCGGCAG GTGCAGAACA CCGTCTCGGC GCTGCGCGCC GTCCTGGGCG AGCGGACCGT CGAGGCCGTG GCCGAGGGCT ACCGGCTCGC GGTTCCCGCC GAGACCGTCG ACGCCGGACG ATTCGCCGCC GGGGTGCGGC GGGCCCGGCA GCTGCGGGAG GACGGCGACC CCGTCGCGGC CCTGGAACGA CTGCGCGAGG CACTGGCGCT GTGGCGCGGA CACGCCCTGG CGGGCATGGG CGGGCAGGTG CTGGAACGCG GCGCCCGGCG GCTCGGCGAG GACCGGCTGG CCGCCTTCGA GGAACGCGTC GAACTCGAAC TCGAACTGGG ACAACGGGTC CCGGTCGGTG AACTGCGGCA GCTGCTCACC GAGAACCCGT ACCGGCAGCG GCTGGCCGCG CTGTTGATGC TGGCGCTGTA CCGCGAGGGC CGGGCGCCCG AGGCCCTGGA GGTCCACACC CGGATGCGGC AGCGGCTCAG CCGCGACCTG GGCGTCGAAC CCGGTCCGGC GCTGCGGGAA CGCTACGCCG CGATCCTGCG CGAGGACCCG GAACTGGACG TCACCGGCAC CCCGGCACCG GCGTCCCGCG GCGCCACCCG GCCGGAGTTC GCCCCGGCGC AGCTCCCGGC GGCGCTGGCC GGGTTCACCG GCCGCACCTC ACAGCTGCGG GCCCTGGACG CGCTCGGCGA CGACTCGGTC CTGGCGACCA TCACCGGTTG CGGCGGCAGC GGCAAGACCG CGCTGGCCGT CCACTGGGCC CACCGCAACC GCGACCGGTT CCCCGACGGC CAGCTGTACC TCAACCTGCG CGGGTTCGAC GCCGACGCCC CGCTGTCGCC GCAGGACGCG CTGACCCGGC TGCTGCCCGC GCTGGGGCAA CCCGCCGACG CCGTCCCCGC CGAGCTGGAC GCCGCCGCCG CGCTGTTCCG GTCGCTGTTG ACCGGGCGCC GGATGCTGCT GCTGTTGGAC AACGCCCGCG ACGCGGCCCA GGTGGAACCG TTGCTGCCCA ACGAACCCGG GACCGTGACC CTGGTGACCA GCCGCCATCG GCTCACCGAG CTGGCCGCCC ACGGCGCCAC CGCCATCTCG CTGGACACGC TCGACGAGAC CGACGCGCTG GCCCTGTTGT CCACACTGGT CCCCGACGAC CGGCTGGCCG AGGACCCGGC GGCCACCGCC GAACTCGTCT CCCGGTGCGG CGGTCTGCCG CTGGCGCTGC GCATCGTCGG CGCCAACCTC GCCGGTCGTC CCTATTCGAC GGTCGCGGAG TTCGCGGCCG AACACTCCGG TTCCGACCGG CTCGGCCTGC TCACCGTCGA CGGCGATCCC AACGCCACCG TCGCCACCGT CTTCGAACGC TCCGCGCGCG CCCTCGACGA GGACACCCGG CGGCTGTTCC TGCGGCTGGG CCTCATCCCC GGCGACGAGA TCCCCGAGGA CCTGTCGCGC GGCACCGCGG ACCTGTCCGA GACCGTCAAC CGGGAACTGT TGGGACGGCT GGAAAGCGCC CACCTCATCG AACGGCATCG CCCCGGGCTG TACCGGTTCC ACGACCTGGT GCGGCTGTAC GCCCGCCAGC AGGCCGAATC CACACTCGAC CCGGCCGAGG CGGCCGAGGC CCGCACCGCC GCGATCGACT GGTACGCCAG CTCCTTCGAG CAGGTCAGCG CCGACCCCGG CAACGTCATC GCGGCGTTCA AGGCCTGGCA GGACCACCCC GACGCCTGGC GGCTGGCCCG GATGCTGCCG GACTTCCTGC GGTACGGGCA CAGCCTGGCG GGCCTGCGTC CGCACGTCGA GACCGGACTG GCGCTCGCCA AGGACTCCGG GGACGCCGGG GCGCTGAGCC AGATGTACGA CGCGATGGCC TTCGTCCACT CCAACTCCGG CGACCACCTC ACCGCGCTCA CCTGTGGACG GAAAGCGGTC GCGATCGCCC GGGGGCTTCC CGACGGCGAC GCCGACGGCC GGTTGCGCAG CGGGCTGGCG GCGCTGCTGC TGTTCAACTC CTACTACGAG GAGTCGGCGG CCCTGTCCCG CGAGACCCTC GCCATCGCCG AGGCCGACGG CGACATCGGC CGGATCTTCC ACGACCGCAT CTTCCTGGGC GCCGCCTATC GCAGCAGCGG CCGGTTCGCC GATGCCGAGA CCTGCTTCGT TCAGGCGATG CGGCTCGCCG ACACCTGCGA CGACGACGAT CCCCGCCGGG TCACCGCCCG GTTCAAGCTC GCCCGGCTGT ACAACGACAT GGATCGCGTC GAGGCGGCCT GGCCGCTGCT GGAGGCGATC GGGCAATGGT GGCAGCGCAC CGGTTCCGAC TTCATCCGCG AACGGGTGCT GTGGCTTCGG GGCCAGTTGC AGCTGCGTTC CGGCCGCGTC GAGGCGGCCG AGCAGGACCT CGCCGACAGC TGGGAGCTGT CGCACCGCAA CGGCATCCTG GTGTGGGCCT GGTACGCGCG GTTCGCGCAG ATCGAACTGT GCTGCCAGCT CGGCGACCAC GAACGCGGAC TGGCCTACGT CGAGGAGCTC GCCGAGTCCG GCGCCGACAC CTTCGAACGC GACGACCGGG CCCAGTGGGC GGCCCTCAGC GCCAAGGTCC ACATCGGCCT CGGTGACTAC CGGGCGGCGA TCGCGGCGGC CGAACAGGCC CGCACCGTCT TCACCGTCAC CAAGAACCCG CTGCGGCTGG CCCGCTGTCT GGTCACCCTC GCCGAGGCCC ACGAACGACT GGGCGACACC GAGACCGCCG AACGGCACCG CGACGAGGCC CGCCGCGGCT TCGCCGCGCT GGGTCTGTCC GAAAAGGCCA CCAGGATCGG CATGTCGACC GGCTAG
|
Protein sequence | MEFRILGAPE VIRDGAPLAI RGRIAPRLLA VLALEAGHVV PISTLVEALW EDPPVTARRQ VQNTVSALRA VLGERTVEAV AEGYRLAVPA ETVDAGRFAA GVRRARQLRE DGDPVAALER LREALALWRG HALAGMGGQV LERGARRLGE DRLAAFEERV ELELELGQRV PVGELRQLLT ENPYRQRLAA LLMLALYREG RAPEALEVHT RMRQRLSRDL GVEPGPALRE RYAAILREDP ELDVTGTPAP ASRGATRPEF APAQLPAALA GFTGRTSQLR ALDALGDDSV LATITGCGGS GKTALAVHWA HRNRDRFPDG QLYLNLRGFD ADAPLSPQDA LTRLLPALGQ PADAVPAELD AAAALFRSLL TGRRMLLLLD NARDAAQVEP LLPNEPGTVT LVTSRHRLTE LAAHGATAIS LDTLDETDAL ALLSTLVPDD RLAEDPAATA ELVSRCGGLP LALRIVGANL AGRPYSTVAE FAAEHSGSDR LGLLTVDGDP NATVATVFER SARALDEDTR RLFLRLGLIP GDEIPEDLSR GTADLSETVN RELLGRLESA HLIERHRPGL YRFHDLVRLY ARQQAESTLD PAEAAEARTA AIDWYASSFE QVSADPGNVI AAFKAWQDHP DAWRLARMLP DFLRYGHSLA GLRPHVETGL ALAKDSGDAG ALSQMYDAMA FVHSNSGDHL TALTCGRKAV AIARGLPDGD ADGRLRSGLA ALLLFNSYYE ESAALSRETL AIAEADGDIG RIFHDRIFLG AAYRSSGRFA DAETCFVQAM RLADTCDDDD PRRVTARFKL ARLYNDMDRV EAAWPLLEAI GQWWQRTGSD FIRERVLWLR GQLQLRSGRV EAAEQDLADS WELSHRNGIL VWAWYARFAQ IELCCQLGDH ERGLAYVEEL AESGADTFER DDRAQWAALS AKVHIGLGDY RAAIAAAEQA RTVFTVTKNP LRLARCLVTL AEAHERLGDT ETAERHRDEA RRGFAALGLS EKATRIGMST G
|
| |