Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4966 |
Symbol | |
ID | 8886173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 5275500 |
End bp | 5278592 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003513697 |
Protein GI | 291302419 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.998552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGG AAATACGCCT GCTGGGAGCG ATCGAACTGT GGGCCGACGG TCGTCGCGTC GACATCGGCC CGGCCAAGCA GCGTGCCGTC TTCGCGATCC TGGCAGCTGA GGCGACCAGC GTGGTGCCGA CCGACCGCCT CGAACACCAC ACCTGGGGCG ACGCCCCACC CAAGGACGCC CGCCGGACCC TGCACGTATA CCTGACCCGG CTGCGCCGCG CGCTGTCAGG CATCGAGGGC CTGTCGCTGG AGCGGCACGG CGGCGGCTAC GTCCTCGGCG TGGACACCGA ACAGGTGGAC CTGCACCGCT ACCGGCGACT GTGCGGCGCC GCCCGCGACG CCGCCGACGC CTCCCACGCC GCCTCACTGT GGCACGAGGC GTTCGCACTG TGGCGGGGCG AGCCGTTCAC CGACCACGAC ATCCCCCGGC TGAACCGGCT GCGCCACGAG CTGCGCGCCG AACGCGAGAC CGCCGAACTG GACCGTAACG ACGCCTACCT GCGCGCCGGA CGGCACACCG AACTGCTTGC CGACCTGACC GAACAGGTCG AGCGACGTCC GTTGGACGAG CGACTGGCGG CCCAGTTCAT CGACGCGACA CACCAGTCGG GTCGCACCGC CGAGGCCCTC ACCCACTACC GCGACCTCCG CGACCGGCTG GTCGGCGAAC TGGGCCGCGA ACCCGGATCG AGCCTGCGCG ACCTGCACCG CCGCATCCTC AACGACGACC AGGCCCCCAC CGTCACCGAG CAGTCGGTGC CGCGACAACT GCCGGTCACG ACCGTGACGT TCACCGGCAG AGACGCGGAA TCGGCCCACG CGATCGCGCT GCTGGGTTCG GGAACCCCGA TCGTGTCGGT CGACGGCATG GCCGGGGTCG GCAAGACCGC CTTCGCGGTC CGGGTGGCCA CAGAAGTGTC CGACAAGTTC TGCGACGGCC AGCTCTTCGT GGACCTACGC GGCTTCTCCG ACGACCTGGC ACCGCTACCC GCGAACGAGG CGATCGGCGG CATGCTGCGC GACCTCGGCG TGCCGCAGAC CCAGATCCCC GCCGACCTGG CGGGACGCTC GGCGATGCTG CGCAGCCGAC TGGCCGACCG ACGAGTCCTG CTGGTCCTCG ACAACACCAT CGGCACCGAA CAGGTCCTAC CCCTGCTGCC CGGCCCCGGC GACAGCGCCG TACTGATCAC GAGCCGCCGC AAACTGCCCG ACCTGCCCGA CGCCGAACCG ATCACACTGG ACGTCCTACC CCGCCACGAA GCCCGCGAAC TGTTCACGAC GGTCGCGCAA CGAAACATCG ACGCCGAAAC CGACCCCGTG AACGACATCG TCACGCTGGC CGGTCAACTC CCGCTGGCCC TGCGACTGGC GGCGGCCCGA CTGCGCAGCC GTCCGGCATG GACGGTCACC GACCTTCGCG ACCGGATGGC CTCCGAACGA CAAGGCGAAC GCCGCTCACC GGCCGGACGA AAACTCGGAG CCGCCTTCGA ACTGTCCCTA CGCGCCCTCA CCGTCGAAAA ACGCGAGACA TTCCTGTCGG CGAGCCTGAT CCCGGTCCAC GATCTCACCG CGGCATCGGT CGCCGCCGTA ACCCAACACC CCATCGACGA GGTCGAGGAA ACCCTCGAAG AACTGTGCGA CCTCAACCTG CTCACCACCC CGACGGCGGG CCGCTACCAG TACTTCGACC TGCTGCGCGA CTACGCCGCC CAGATAGCCG AAACGAACCA GCCCGCCCAC ACCCGCCACG ACATCACCGG GCGAGCGTTG CGCTGGTACA TGGCGAATGC CCGCACCGCA TGCGCGGCGG TACGTCTCCC GTTCCCCGAG AACCCGGGCC TGCCCACCGA CCCTGCGGAC ATGAGGTTCG ACGACGAGCA GTCCGCGTTG GCCTGGCTGG ACTCCGAACG CGGGAACCTG CTGGCGCTGT TGCGCCACTC AAGTGCCACA TCGATACCGA TGTGGACAAT GGTCGACGCG ATCTCGAACT ACCTGCTGTA CCGCGCCGAA GGCGCCAGCC TGCTCGAGAT CTGCGACCTC GCACTGAGCG AACCCGAGGC CCTGGCCGAC AACCTCGCCC AGGTCAAACT GTTGAGCCGC AAAGCGAGCG CGGCACAGAG CCTGGGCGAC CGGACGGCCA TGCTGACCTA CACCCAAGCC GCCCGCGAAC GCCTCACCCC CGACGCCGGA CCGAAACTGC GACTGGGCGC CCTGTCGCAA CTGATCATGG TCCACCGAAC CCTCGGCAAC GTCGCGGAAG GAGCCACGGC CGCGGCGGAG GCGCTCAAGA CCTACCGCGA ACTCGACGAA GACGGTGCCT CGTACATCCT GCAACAAGTC GCGGCGGCGG TCTCCGACAC CGGCGACCTG CACGCGACCC GCGAACTGCT CGAAGAGGCC AGTCGTGAGT TCCGGCGACG CGACAGCCTC AACCTGGGTT TCGCGCTCAC CGCGCTGGTG GAGGTCTGCA CGGAACTGGG CGACTTCGAA GCGGCCGAAC ACTTCGCGGA CGAGACCCGC AAGTGGATGG GCAAGGCCGG AACCGAAACG GCCCAGCCGC AACTGCACCA AGACATGGCG CTGCTCCACC ACGCCCGGGG CGAAACCGAA CCAGCCCTAG AACACGCCCG CCGCGCGGTC GACCTGGCCC GCCAAATGGG CATGCACGGC GTCGACAGCG CAGTCCTGTC CACACTGGCC CGGGTCTCAC GAGACGTCTC ACCGGACGCA CACCAATACG CCGAGGAAGC AGTCAAAACC AGCCGAGACC GCGAAGCGAT CGCCGAACAC ATCGTCGCCC TCTGGGTACT GGCCGACATC CAACTCGCCG CGGGCGACAC CACCTCGGCC ACCCACACGG CGACCACAGC CCTGGACCTG GCCCGCAAAC ACGGATACCG CCTCCTGAAG GCCAAGATCC TGACGGTACT GACCGAAGTC CACCTGGCCA CAGGCGACCA CGACCAAGCC CGCCACACCG GAACCGAGGC CCTACGCGAC CACCAGTGGT GCGGATCCCG CCCCCAACAT GCCAAAGTCC ACAAGCTACT AGCGCAGGCG ACCACCGATG ACGGGTCAGG TCGGATCAGT TAG
|
Protein sequence | MTLEIRLLGA IELWADGRRV DIGPAKQRAV FAILAAEATS VVPTDRLEHH TWGDAPPKDA RRTLHVYLTR LRRALSGIEG LSLERHGGGY VLGVDTEQVD LHRYRRLCGA ARDAADASHA ASLWHEAFAL WRGEPFTDHD IPRLNRLRHE LRAERETAEL DRNDAYLRAG RHTELLADLT EQVERRPLDE RLAAQFIDAT HQSGRTAEAL THYRDLRDRL VGELGREPGS SLRDLHRRIL NDDQAPTVTE QSVPRQLPVT TVTFTGRDAE SAHAIALLGS GTPIVSVDGM AGVGKTAFAV RVATEVSDKF CDGQLFVDLR GFSDDLAPLP ANEAIGGMLR DLGVPQTQIP ADLAGRSAML RSRLADRRVL LVLDNTIGTE QVLPLLPGPG DSAVLITSRR KLPDLPDAEP ITLDVLPRHE ARELFTTVAQ RNIDAETDPV NDIVTLAGQL PLALRLAAAR LRSRPAWTVT DLRDRMASER QGERRSPAGR KLGAAFELSL RALTVEKRET FLSASLIPVH DLTAASVAAV TQHPIDEVEE TLEELCDLNL LTTPTAGRYQ YFDLLRDYAA QIAETNQPAH TRHDITGRAL RWYMANARTA CAAVRLPFPE NPGLPTDPAD MRFDDEQSAL AWLDSERGNL LALLRHSSAT SIPMWTMVDA ISNYLLYRAE GASLLEICDL ALSEPEALAD NLAQVKLLSR KASAAQSLGD RTAMLTYTQA ARERLTPDAG PKLRLGALSQ LIMVHRTLGN VAEGATAAAE ALKTYRELDE DGASYILQQV AAAVSDTGDL HATRELLEEA SREFRRRDSL NLGFALTALV EVCTELGDFE AAEHFADETR KWMGKAGTET AQPQLHQDMA LLHHARGETE PALEHARRAV DLARQMGMHG VDSAVLSTLA RVSRDVSPDA HQYAEEAVKT SRDREAIAEH IVALWVLADI QLAAGDTTSA THTATTALDL ARKHGYRLLK AKILTVLTEV HLATGDHDQA RHTGTEALRD HQWCGSRPQH AKVHKLLAQA TTDDGSGRIS
|
| |