Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4482 |
Symbol | |
ID | 8885687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4780275 |
End bp | 4783100 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003513220 |
Protein GI | 291301942 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.434883 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.14218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGCAAT TCCGGGTCCT GGGCCCGATG GAAGTCTGCT GTGATGGGCA AGTGGTGCCG ACCGGCGGTG GCCGCAGTAA GGCCGTGCTG GCGGCACTTC TTTTGCACGC CAACAAAACC GTCTCCCGGC CCCGGCTGAT CGAGCTGGTG TGGGCTGACG CCCCCGACTC CGTCGACTCC AACCTGCGCA GCTACCTGGC GAAACTGCGC AAGAAGCTGC ACATCCCCGG CGAGGACGCC TCCCGGCTGG TCGCCGACCC GCACGGCTTC CGGGTCGTGG TCGACGAGGG TGAGCTCGAC CTGGCCGAGT TCGACGCGCT GGCCAGCCGG GGCGAACAGG CCCTGGCCGC CGAGGATCCC GCCACCGCCG CCGACTGCTT CACCCGGGCG CTGGCCCTGT GGCGGGGACC GGCGTTGGGC GGCGTCCTGG TGGAACCCAA CCTGGCCGCG GCTGTCGCGC ACGTCGAGGA ACGCCGCGAA CACATCGAAC TGCGCAACTT CGAGGCCCGG CTGGCCCTGG GACAGCACGG CGAACTCATC GGCGAACTGC GCGGACTGAT CTCGGCGAGG CCGCTGCGCG AGAAACCCGT CGCCCTGCTC ATGCTGGCGC TGTACCGCTC GGGTCGCCAG AAGGAAGCCC TGGAGTTGTA CCGCCAGACC CGGGCCCGGT TCCGCGACGA ACTGGGCTGC GACCCGGGCA AGAACCTGAC CGAACTGCAC CGCCGCATCC TCAGCGCCGA CGCCGGACTG GACGCGCCCC GGCCCGCCGT CGCCGTGGCC GCCGCCCCCG CCCGGGTGGT GCCGTCGCAG CTGCCCGCCG ACGTCCCCGC CTTCACCGGC CGCCAACCGG AACTGGCCGC GCTGCGGCAG CTGTCCCGCG GCTGCCTGGA CGCCGAGGCC GCCGGATCGG TGGTGGTGTG CGCCCTGGAC GGCATGCCCG GCATCGGAAA GACGACGCTG GCCGTGCACG CCGCCCGGCT GCTGGCCCCC GACTTCCCCG ACGGCCAGCT GTTCTTGGAC CTGCACGGTT TCAGCGGCGC CGGGGAACGC GTCGAACCCG GCGACGCCCT CGACCGGATG CTGCGGGCGC TGGGCCTGGC CGTCGAGGAC ATCCCCGCCG AGGTGGAGGA CCGCGCGGCG CTGTACCGCT CGCTGCTGTC CGACCGGCGG CTGCTCATCG TGCTCGACAA CGCCGCCGAC GAGGCCCAGC TGCGGCCGCT GCTGCCCGGC GGCTCCCGCT GCCTGGTCAT CGTCACCAGT CGCCGCCGGA TGTCGGCCCT GGACGAGGTC ACCCCGATCC CGCTGGACGT CATGAGCGCC GACGAGGCCA CGGCGCTGTT CCTGGACGCC GCCGCGACCA CCGGACTCGA CGACACCGCG GCGGGCATCG TCGCCGAGGT CGTCGAGACC TGCGGACGGT TGCCGCTGGC GGTGCGGATC GCCGCCGCCC GGCTGCGCAG CCGTCCCAAC TGGACCCTGG CCGACCTGCG CGACCGGCTG GCCCGCGACG GCGAACTGCT GTCCAAACTC GAGTTCGGGC AGCGCAGCGT CCGGGTCGCC TTCGAGATGT CCTACCGGGA ACTGGACGAA CGGCACGCCC ACCTGTTCCG GCTGCTGGGC CTGGCGCCCG GCTCCGACCT GTCGGTGGGC GCGGTGTGCG CGCTGTTGGG CGAGGCGGCC GCCCACGAGG TCGAGACCCG GCTGGAGGAC CTGGTCGACG CCCACCTGCT GCGGTCCCGG CGGCCCGGAC GCTACGCGTT CCACGACCTG CTGCGCGCCT ACGCCGTCAA CCTCACCGAG TCCACCGACC CCGCCGAGCT GCGCGCCGAG GCGGTGGAGC GCATCGTCGA CTACCACGTC CACACCGCGC ACCGGGCGAC GCTGCGGCTG GACCCGCTGC GGCGCGTCAC CCCGCTGCCG CCGCTGCACG ACGCCGTCAA ACCCGCCGAG TTCGTCGACA GCCGCGCCGC CCGGGACTGG TTCGACGAGG AACTGGCGAC ACTGAAGGCG GTGCTGCGAC TGACCGAGAC GCGGCGACTG GACGAGTCGA CCTACCAGCT GGTGTGGGCG ATGGAGACCT ACCTGAGCCG CCGGGGCCTG TGGGCCGACT GGGAACGGTT CCAGCGGCAG GCCATCGCCG CGGCCGAGCG ACTGGGCGAC CTGCCCCGGC AGGCCTACGC GCACCGGGTG CTGGGCCGTC CGCTGACCCA GCTGAGCCGC TACCCCGAGG CCTGCGCCGA GTTCAGCAAG GCCGTGGCGC TGTTCAAGCA GCTCGACGAC CCGGTCGGGC AGGCCGACTC GCACCGGGGC CTGGCCTGGG CGTACTCCCG CACCGGCGAC CGGCAGCGCG GCCTGGAGAA CGTCATGGAG AGCCTGGTCC TGTACCGCAA GGCCGGGAAC CAGCGCCGGG TCGCGCACGC CCTCAACGGC GTGGGCTGGC AGCACGCCCT GGCGGGGGAG TACCCGCAAA CCCTCGACTA CTGTGGACAA GCGATCGAGA TCTTCGCCGA ACTCACCAAA CAGCACGGTC ACCACGACGC CCACGCCGAG GCCGAGACCT GGGACAGCCT CGGCTACGCC CACCACCATC TGGGCAACCA CACCGAGGCC ATCACCTGCT ACCGCACCGC CTTGGACCTG CTGGAGGACG TCGAGGACCG GTACTTCACC ACCGAGGTGC TCACCAACCT CGGCGACGTG TACCTGTCCG AGGACGACCG CGACGCCGCC CGCCGGGTGT GGAGCCAAGC GCTGACCACA CTGGACGAGC TCGGGCATCC GTACGCCGAC AAGGTTCGCA GCCGACTGGC GACACTGGAC GAATGA
|
Protein sequence | MLQFRVLGPM EVCCDGQVVP TGGGRSKAVL AALLLHANKT VSRPRLIELV WADAPDSVDS NLRSYLAKLR KKLHIPGEDA SRLVADPHGF RVVVDEGELD LAEFDALASR GEQALAAEDP ATAADCFTRA LALWRGPALG GVLVEPNLAA AVAHVEERRE HIELRNFEAR LALGQHGELI GELRGLISAR PLREKPVALL MLALYRSGRQ KEALELYRQT RARFRDELGC DPGKNLTELH RRILSADAGL DAPRPAVAVA AAPARVVPSQ LPADVPAFTG RQPELAALRQ LSRGCLDAEA AGSVVVCALD GMPGIGKTTL AVHAARLLAP DFPDGQLFLD LHGFSGAGER VEPGDALDRM LRALGLAVED IPAEVEDRAA LYRSLLSDRR LLIVLDNAAD EAQLRPLLPG GSRCLVIVTS RRRMSALDEV TPIPLDVMSA DEATALFLDA AATTGLDDTA AGIVAEVVET CGRLPLAVRI AAARLRSRPN WTLADLRDRL ARDGELLSKL EFGQRSVRVA FEMSYRELDE RHAHLFRLLG LAPGSDLSVG AVCALLGEAA AHEVETRLED LVDAHLLRSR RPGRYAFHDL LRAYAVNLTE STDPAELRAE AVERIVDYHV HTAHRATLRL DPLRRVTPLP PLHDAVKPAE FVDSRAARDW FDEELATLKA VLRLTETRRL DESTYQLVWA METYLSRRGL WADWERFQRQ AIAAAERLGD LPRQAYAHRV LGRPLTQLSR YPEACAEFSK AVALFKQLDD PVGQADSHRG LAWAYSRTGD RQRGLENVME SLVLYRKAGN QRRVAHALNG VGWQHALAGE YPQTLDYCGQ AIEIFAELTK QHGHHDAHAE AETWDSLGYA HHHLGNHTEA ITCYRTALDL LEDVEDRYFT TEVLTNLGDV YLSEDDRDAA RRVWSQALTT LDELGHPYAD KVRSRLATLD E
|
| |