Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3863 |
Symbol | |
ID | 8885063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 4124373 |
End bp | 4127318 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512611 |
Protein GI | 291301333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.291313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTTCT CGGTCTTGGG ACCGGTGTCG GTGTCCACGT CGGAGGGACC GGTGACCGTC GAGGCACCGA AACAGCGGCT GCTGCTGGCG CACCTCATCA GCCGGACCAA CCACCCGCTG CCGCCCGACA GCCTCATCGA GCTGCTGTGG AGCCACAATC CGCCGTCCTC GGCGCGCAAG GCGCTGGCCT GGCACGTCAT GCAGCTGCGC AACGTCCTGG GCGGCAAGGA ACGACTGGCC TGGCACGGCA ACGGATACGT CCTCAACACC GAACGTGACG AAGTGGACGC CGCCCGCTTC GAGGAACTGC ACCGGCAGGC CACCGCCGTG CGCGACGCCG ATCCCCGGCG GGCCGCGCAA CTGCTGAACC AGGCGCTGGG CCTGTGGCGC GGCACCGCGT ACGGGGAACT GCCCGAGTCG GGTGCGCTGC TGGAGGAGGC CAACCGGCTC AACGAACTGC GGCTGGTGGC GCTGGAGGCG CGGTGCGACA TCGACCTGGA GCTGGGACGG CACGGCGACA TCGTGCCCGA ACTGACCGGT CTGGTGGCCG ATCACCCGTT CCGGGAGAAG TTCCGGGCCG CGTTGATGCT GGCGCTGTAC CGCTGCGGCC GCCAGGCCGA CGCCCTGCGC AGCTACCGGG AGGGCCGGAC GCGGTTCGCC GAGGAGCTCG GCCTGGAACC GGGACCGAGT CTGCGACAAC TGGAACAGCG CATCCTCAAC GCGGACCCCG GTCTCGACGC GCCGGTCGCC GAGACCGCGA CGATCGCGTC GGTGGTCCCG GCGCAGCTGC CCGCCGACCT GCGGTCGTTC ACGGGACGGG AGCCGGAGGT GGCGCGGTTG CTGGACCTGT CCACCGTGGA CACCAATCGG CCCGGGGCCA TCGTGGTCGG TGCCCTGGAC GGGATGGCGG GGATCGGCAA GACGGCGCTG GCGGTGCATG TCGCGCAGCG GCTGACGGCC AGCTACCCCG GTGGGCAGCT GTTCATCGAC CTGCACGGTT TCACCGAGGG CGTCACGCCG GTGACGCCGG GGCAGGCCCT GGATCGCATG CTGCGGACGC TGGGGGTGGC GTTGCAGCAG ATCCCGCCCG ACGTCGACGA ACGCGCGGCG CTGTACCGCA GCCTGCTGGC GGACCGGCGG ATGCTCATCG TCCTGGACAA TGCGGTCAAC GAGGCCCAGG TGACGCCGCT GCTGCCGGGG GCTTCGGGGA GCCTGGTGCT GATCACCAGT CGTCGGCGGC TGGTGGGGCT GGAGGGGGCG CAGTATCTGC AACTGGACGT GCTGTCGCCC GACGAGGCCG TGTCGCTGCT GCTGCGACTG GCCGAGATCT CGCAGCCGTC CGACGCGGAT CGGGAGCTGG CGGCCGAGAT CGTGACGCTG TGCGGGCGGC TGCCGCTGGC GGTCCGGATA GCGGCGGCGA AGCTGCGGCA CCGTCGGCAC TGGTCACTGC GGACCGTCCG GGACCGGCTG CTGGATGAAC GCGACCGGCT GCACCAGCTG GAACTGGGTG AGCGCAGCGT GTCGGCGGCG TTCACGATGT CTTATGAGGA CATCGATGCC GAGGCGCGGC GAGTCTTCAG GTTGTTGAGC CTGTTCCCCG GTTCCCATTT CGACGTACTG GTCGCGGCGG CGCTGGCCGA TCGCTCCGTC GCGGTGGTCG AGGAGCTGCT GGACGTCCTC ATCGAGGCCA ATCTGCTGAC GGTGCTGGGG CCAGGCCGGT TCGCGTTCCA TGACCTGTTG CGGCGGTTCG CGAACCAGGC CCACGAGGCG GCGGCCGACT ACGCGACTGA GACGGCCGAG CTGCACAGCA GGTTGCTCAA CTACTACCGC CACGCCGTCT ACTCGGTGGC GACCACCATC GACCCCGGCA TGGTGAACCT CGCCGAACCC CCACAGACCG CCTTGGCGCT TCCCGAACTG CCCACCACCG AATCGGCTCA CGACTGGTAC CAGGCCGAGC ACCTCAACGT CTTCGCCGTC ATCGACCTGG CACCCGACTG GGGGCTGGAC GAACAGCTGT GTCAGCTGGT CAACGCGGTG AGCACCGTGT CGATCATCTT CTCGCACTAC CAATGGCAGT ACGACATGTG CGAGCGCGGC CTCCAGGCGG CGCGGCGCTG CGGCGACCGG GACTGCGAGG CCCGGCTGTT GAACCACCAG GCGCTGGCGC TGCGAAAACT CGACCGCGTC CCCGAGGCCG TCGCACTGCA CGAACAGTCG CTGCTGCTGC GGCGGGAACT GGGCGACAGA CTTGGCGAGG CCGCGATACT CAACAATCTC GGTCTCATCC ACCGGCGAGC CGGTGCCCTC GACAAGGCGA TCGCGACCTA CGAACAGGCG CTGCGGCTGG GTGACGACGC GCGCATGATG TCCATACACG CGCTGCTGCG CAGCAATCTG GCCGAGTGCT GGATCCGGCT GGAGCGTTTC GACGCGGCGC TGGAGCAGGT GCGGCTGGCC GAACCGATCA TCACCGAGCT CGGCAGCGAA CGCCAGACCG CCCGGCTCCA GCACTACTAC GGGTCGGTGT ACTACCACCT CGGCGAGTAC GACCGGGCGC TGCGGCACTT CGCCCGGTCT CTCGAATACT GCGAGCGGGT CATGGAACCG TACGGGCATG CCAGTGTCCT CAACGGCATG GCCAATGTGT ACCGGGATCG GGGCGACATT CCCACCGCGG TCGACTTCCA TGAGCGTGCC CTGCTGCTGT GCCGTTCGAT CAGCGACACC GATCTGGAGG CGCTGATCCT GTACGGCCTG GGTCGCACCC ACCGCGCGGC CGGTCACCGC GAGCTGGCGT TGAGCAACCT GCGCGCGGCC GTCGCGGCGG CCACCCAGAC CGGGGACGCC TACCAGCTGG AACACGCGAA CCGGGAGCTG GCCGACACGC AGGCGGCGGA CCATCCGCAG CGGCTGGAAT CCAGCGACAC CCCGGCCGTC AACTGA
|
Protein sequence | MQFSVLGPVS VSTSEGPVTV EAPKQRLLLA HLISRTNHPL PPDSLIELLW SHNPPSSARK ALAWHVMQLR NVLGGKERLA WHGNGYVLNT ERDEVDAARF EELHRQATAV RDADPRRAAQ LLNQALGLWR GTAYGELPES GALLEEANRL NELRLVALEA RCDIDLELGR HGDIVPELTG LVADHPFREK FRAALMLALY RCGRQADALR SYREGRTRFA EELGLEPGPS LRQLEQRILN ADPGLDAPVA ETATIASVVP AQLPADLRSF TGREPEVARL LDLSTVDTNR PGAIVVGALD GMAGIGKTAL AVHVAQRLTA SYPGGQLFID LHGFTEGVTP VTPGQALDRM LRTLGVALQQ IPPDVDERAA LYRSLLADRR MLIVLDNAVN EAQVTPLLPG ASGSLVLITS RRRLVGLEGA QYLQLDVLSP DEAVSLLLRL AEISQPSDAD RELAAEIVTL CGRLPLAVRI AAAKLRHRRH WSLRTVRDRL LDERDRLHQL ELGERSVSAA FTMSYEDIDA EARRVFRLLS LFPGSHFDVL VAAALADRSV AVVEELLDVL IEANLLTVLG PGRFAFHDLL RRFANQAHEA AADYATETAE LHSRLLNYYR HAVYSVATTI DPGMVNLAEP PQTALALPEL PTTESAHDWY QAEHLNVFAV IDLAPDWGLD EQLCQLVNAV STVSIIFSHY QWQYDMCERG LQAARRCGDR DCEARLLNHQ ALALRKLDRV PEAVALHEQS LLLRRELGDR LGEAAILNNL GLIHRRAGAL DKAIATYEQA LRLGDDARMM SIHALLRSNL AECWIRLERF DAALEQVRLA EPIITELGSE RQTARLQHYY GSVYYHLGEY DRALRHFARS LEYCERVMEP YGHASVLNGM ANVYRDRGDI PTAVDFHERA LLLCRSISDT DLEALILYGL GRTHRAAGHR ELALSNLRAA VAAATQTGDA YQLEHANREL ADTQAADHPQ RLESSDTPAV N
|
| |