Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0196 |
Symbol | |
ID | 8881374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 208265 |
End bp | 211168 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003509008 |
Protein GI | 291297730 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.46607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0345978 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCC CCCTCGGGCC GTCCAGCGCA TTCGAATCCG CACCGAAAAT CCCCATGGAC AGTCACATCA CACTCCCCAG CGAACTCCGG CCGGTGCTGG TCAGGCTGCT CGGTCCGGTC GAGATCGCCG GTGAGGACGG CTGGCAGCGG GGCGGGGCGC CGAAACAGGC GTGCGTGCTG GCCTGTCTGG CGCTGTCCCC CGGCACCGCG GTGAGCATCG AGACACTGGC GCACCGCGTC TGGGACGGCC CGCATCCCAC CGAGACCCGC AACATCATCT ACGGCCACGT CACCCGGCTG CGGCAGTTGC TTAAACCGCA CGACGAGGTC CGGCTGCGGC GCATGGGCAC CTCGGAGTAC CAATTGGACA TCGAGCCGGA ACTGGTGGAC GCGTGGCGCA TGCGCGACCT GGCGGGCAAG GCGCGGGCGG TGCAGGCCGG TGGCGACATG ACGACGGCGG CGGAGTTGTG GCGCGGCGCG GTCGAGCAGT GGCGCGGCCC GGCGCTGGCG GGCATCAAGA CCGAGTGGAG CGCCCGCACC AGCCGTCGGC TGCGCAACGA CTACCTCGCG GCGGCGGCGG GTTGGAGCGA GTGCCTTCTC CAGTTGGGGC AGCACGAGGC CGTCGTCGGC AACCTCGAAC CCATTGTGGA GCATCATCCG TTGGTGGAGA ACCTGGTGGC GCCGTTGCTG TTGGCGCTGT ACCGCTGCGG TCGGACGATG GACGCGCTCA ACCGTTACAC CGACACCCGC CGCCAGCTGC GCAAGACGCT GGGCAACGAC CCCGGGGAGC GGCTGCGGAC GCTGTACCAG CGGATCCTGC GCCAGGAACC GGAACTGTTG CGGGCGCCCG CGACCGAGCG GCGCGAGGCC GCGCCCGCCG CGGTGTCACC GGCACCGGCG CCGGTGCCCG CACAGCTGCC CGCCCGGGTG TCGGGGTTCA TCAACCGGGA GGCGCAGCTG GAGACGCTGG ACGCGCAGCT GCCGTCCGCG TCGCTGGTGA CGATCTCCGG CATGGCCGGG GTCGGCAAGA CGGCGCTGGC GGTGCACTGG GCGCAGCGGA TCGCGTCCCG GTTCCCCGAC GGGCAGCTGT ACGTGAACCT GCGTGGCTTC GATCCGTCCG GGCAACCGAC CGAACCGGCC GACGTGATCC GGGGGTTCCT GGACGCGCTG GCCGTTCCGC CGCACAGCAT CCCGGTGTCG CCGGACGCGC AGATCGGGCT GTACCGCAGC CTGGTGGCGG ACCGCAAGAT GTTGATCCTG CTCGACAACG CGGGCGAGGA GAAGCAGGTG CGGGACCTGC TGCCGGGGAC GCCGGAGTGC CTGACGATCG TCACCAGTCG CAACCGGCTC ACCGGGTTGA CGGCCAGCCA CGGGGCGGTG CCGATGCCGC TGTCGGAGTT CACCCCGGAG GAGTCGCGCC GGTTTCTGCG GAGCCGGTTG GGCGAGGGGC AGCTGGCGGC CGAACCGCAG GCCGCCGACA CGATCATCGC CACCTGCGCC GGACTGCCGC TGGCGCTGGC CGTGGTGGCG GCGCGGGCGG CCACGATGCC GCAGGTGCAG CTGGAGCAGT CGGCGGCGGA GTTGCGCGGC GCCACCGGGG ATCTGGAACC GTTCGTGATG AGCGACGTGT CGACCGACAT CCGGGCGGTG TTCTCGTGGT CCTACCGGCT GCTGGGCGCC GAGGCCGCGC ACTTCTTCAT CCTGCTGGGC CACCATCCCG GTCCCGACAT CTCCACGGCC GCCGCCGCGT CGCTGGCGGG GGTGCCGCTG CCGAGGGCGC GGGCGCTGCT GGGTGAACTG CTGCGGGTGC ACCTGGTGAC CGAGCGCCGT CCGGGCCGGT TCGTGTGCCA CGACCTGCTG CGCTCCTACG CGTCCGAACT GGACACACTG GCGGAGTGGG AGGGGCTGCG CGAGGGTCTG CGGCGCGTCG TCGACCACTA CACCCACACC GCCCACGCGG CGGCGCGGCT GGTGCATCCG ATGCACGACC TGCCGCCCGA GCCATCGCGC TGTCCCGGCG TCACCCCGGA GAAACCCGCC GACCGGCAGG CCGCGATGGC CTGGCTGCGC ACCGAACACG AGGCGCTGTT GGCCAGTATG GAGCTGTCGA TCGCCGACGG CTGGGACTCA CGGACGCTGC GGATCGCGGC CGCGATGCTG ATCTGCCTTG ACCTGCACGG CATGTGGGGC GTCATGGTCG CCACCCAGGA GAAGGCGCTG GCGGCGGCGC AGCGGCTCGG CGACCCGCGG GCGATCGCCG GGGTGCACCG CGACCTCAGC CGCGCCTACT CCCGGCTGGG GCGCTTCGAC GAGGCCGAAC AGCGAATAAC CGAGGCGCTG CGGCAGTTCG AGCAGCTGGG CGACCTGGGC GGGCAGGCGC GGGTCCAGCA CCAGGCGTCG GTGCTCAACG CGATGCGGGG ACGGCACCGG GAGGCCCTCA CGGCCGCCAC CCGCGCGGTC GAGCTGGGTG AACTCGCCGG TGACCTGGTC GCGCAGGCCA TCGGGCACAA CGCCATCGGC TGGCATCTGT CGCAACTGGG CGATCAGTCG GAGGCACTGT CGCACTGCCA GCGGGCGATG GTGCTGGCCG ACAAGATCGG CTACCAGCAG ATCAAGGGCG GCATCTGGGA CAGCCTGGGC CATATCCACC GCCGTCGGGG TTGCCTGGAC CAGGCGATGC ACTGCTTCCG GCGGGCCGTG GACGACGAGC TGGCGATCGG CAACCGGCTG GGGTTCGCCT CCGCGCTGGT CGGGTGCGGG GACGTCCGGT TCCTGCAGGG CGAGACCGAC GCGGCCCGCG AGACCTGGCA GCGGGCGCTG TCGATCCTGG AGGCACTGAA GCACCCCGAC GCCGAGCGGG TGCGGGGCCG GCTGGCCGGG CGGCTCGACC CGTCCGGGGA CTGA
|
Protein sequence | MSLPLGPSSA FESAPKIPMD SHITLPSELR PVLVRLLGPV EIAGEDGWQR GGAPKQACVL ACLALSPGTA VSIETLAHRV WDGPHPTETR NIIYGHVTRL RQLLKPHDEV RLRRMGTSEY QLDIEPELVD AWRMRDLAGK ARAVQAGGDM TTAAELWRGA VEQWRGPALA GIKTEWSART SRRLRNDYLA AAAGWSECLL QLGQHEAVVG NLEPIVEHHP LVENLVAPLL LALYRCGRTM DALNRYTDTR RQLRKTLGND PGERLRTLYQ RILRQEPELL RAPATERREA APAAVSPAPA PVPAQLPARV SGFINREAQL ETLDAQLPSA SLVTISGMAG VGKTALAVHW AQRIASRFPD GQLYVNLRGF DPSGQPTEPA DVIRGFLDAL AVPPHSIPVS PDAQIGLYRS LVADRKMLIL LDNAGEEKQV RDLLPGTPEC LTIVTSRNRL TGLTASHGAV PMPLSEFTPE ESRRFLRSRL GEGQLAAEPQ AADTIIATCA GLPLALAVVA ARAATMPQVQ LEQSAAELRG ATGDLEPFVM SDVSTDIRAV FSWSYRLLGA EAAHFFILLG HHPGPDISTA AAASLAGVPL PRARALLGEL LRVHLVTERR PGRFVCHDLL RSYASELDTL AEWEGLREGL RRVVDHYTHT AHAAARLVHP MHDLPPEPSR CPGVTPEKPA DRQAAMAWLR TEHEALLASM ELSIADGWDS RTLRIAAAML ICLDLHGMWG VMVATQEKAL AAAQRLGDPR AIAGVHRDLS RAYSRLGRFD EAEQRITEAL RQFEQLGDLG GQARVQHQAS VLNAMRGRHR EALTAATRAV ELGELAGDLV AQAIGHNAIG WHLSQLGDQS EALSHCQRAM VLADKIGYQQ IKGGIWDSLG HIHRRRGCLD QAMHCFRRAV DDELAIGNRL GFASALVGCG DVRFLQGETD AARETWQRAL SILEALKHPD AERVRGRLAG RLDPSGD
|
| |