Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3399 |
Symbol | |
ID | 8884598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 3602178 |
End bp | 3604121 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512156 |
Protein GI | 291300878 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00991696 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.321929 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACATCC TCATCAAGGT GCTGGGGCCG GTGCGGGTCA CCGACGCGGC GGGCGGTCCG GTCGCGGTGG GCAGTCAGCG GCGGCGTGAG CTGCTGGGTC GGCTGGTGGC CGCCGGGGGC CGCGCGGTGT CGCTGCGGGC GCTCGTCGAG GACCTGTGGG ACGACCCGCC GTCCACGGCC ACCGGCACGG TGCGCACCTT CGTGTCGGAG CTGCGGCGCG CGTTGGAGCC GCAGCGTTCG CCGCGGGGCC GCTCACGGCT CATCGAGACC GTCGGCACCG GGTACGCGCT GCGGGTGCCG CGCGAGCGCG TGGACGCGCA CCGGTTCGAG GACACGCTGC GCGCCGCCCG GGATCAGTCC GGTCAGGACG CGGCTGCCGC GCTGACCGAC GCGATCGCTT GGTGGGACGG CGAACCGTAC GCCGATCTCG ACGCCTCGGC GTGGGTGGTC CGCGAGCGCG CCCGGCTGGG CGAGCTTCAC CTCCAGGCGG TCGAGTTGCG CGCGCGGGCG GTGATCGACC TGGGGCGCGG CGAGTCGCTG GTGGCGGAGC TGGAGGCGTT CGCGGCCTCG CATCCGTGGC GCGAGCACGC CTGGGTGTTG CTGTCCCACG CCCTGTACCA GGCCGACCGG CAGGTCGACG CACTGTCCAC ACTGCGCACG GCTCGGGAGC GGCTGCTGGA CAGGTTCGGG TTGGAGTCGG CGGCCGGTCT CGACGATCTC GAACGTGACA TCCTGCGTCA CGCCGCGCAC CTGGCGCCCG CCGCCCGCGA CGGCAACCGG CTGGGGCTGC TCACCCGCAC CGAGGCGAGC GGGACCTACT CCCGGCTGCG CGCCATGAGC ACTGTGGCCA GTGCGGCGGC GATCACCGGC GGCACCAACC TGGTGCTGGC CCAGCGGCAA CGCGCGGCGG CCGTCGCCGA GGCCGAGCGC ACCGACGACC CCGATCTCAC CGCTCGTGTC ATCGCCGCCT ACGACGTCCC GGCCGTGTGG TCGCGCGCCG ACGATCCCGA ACAGTCGCGG GCGCTGGTTG CGACGACCCG GCGCACTCTG CTGCGGCTCG GGTCGGACGC CCCCGCGGCG CTTCGGGCGC GGCTGCTGGC CACCGTCGCG CTGGAACACC GGGGGTCGCG CGACGCGTGG GCCGCCGAGG CCGCGCGCGA AGCCGAGCGC ATCGCCCGCG ACCTGTCCGA CCCGAACGTG CTGGTGCTGG CGCTCAACGG CCGGTTCATG CAGTCCTTCC AGCGTCCGGG AAACACCGCC GAACGCGACG CCATCGCCAG TGAACTCATC GCGGTGTCCA CTCGGCACGA TCTGGCGACG TTCGAGATCC TCGGTCACCT CATCAAGATC CAGGTCGGCG CGGCGCTGGG CGACATCGAT ACCGCCGAGC GCCATGTCGC GGCGGCCGAG CGGCTGGCCG ACGTCCACGA GACGCCGCTG GTTCGCGTCC TGACCGCCGC CTTCGCCGCG ATGCGACTGG CGCAGGCGTC CGGCGATCCC GCCGAGGCCG CGCGGGCGTA CCGGGCTGTG GCTACGGAAC TGGCGGGTGC CGGTATGCCC GGGGTGGAGG CCGGGATGTT CCCGCTCGCG CTGTTGTCGT TGCGGTTGCG GCACGGGCGG CCCGCGCCGG TGGATCCGGA TCTCGACAGG GGTCCCTATC GTGCCTGGGT GCAGCCGCTC ATCGATCTCG CGCGGGACGA CCCGTCGGCG GCGCGCCGGT CGGCTGCCGC GTTGCCCGAA CCGGCCGCCG ACCACCTGTA CGACGCGCTG TGGGCGGTCA CCGCACATAC CGCGATCCGC TTGGAGGACG CGTCGCTGGC GGCTCGGGCC CGCGAGGCGC TGGACCCGTT GCGGGGCCAG ATCGCGGGCG GGACAACGGC TATCGTCAGT TTCGGACCGG TGGATGACAT CCTGGCCGAG CTCGACGCGC ATCGCGGGTC ATGA
|
Protein sequence | MDILIKVLGP VRVTDAAGGP VAVGSQRRRE LLGRLVAAGG RAVSLRALVE DLWDDPPSTA TGTVRTFVSE LRRALEPQRS PRGRSRLIET VGTGYALRVP RERVDAHRFE DTLRAARDQS GQDAAAALTD AIAWWDGEPY ADLDASAWVV RERARLGELH LQAVELRARA VIDLGRGESL VAELEAFAAS HPWREHAWVL LSHALYQADR QVDALSTLRT ARERLLDRFG LESAAGLDDL ERDILRHAAH LAPAARDGNR LGLLTRTEAS GTYSRLRAMS TVASAAAITG GTNLVLAQRQ RAAAVAEAER TDDPDLTARV IAAYDVPAVW SRADDPEQSR ALVATTRRTL LRLGSDAPAA LRARLLATVA LEHRGSRDAW AAEAAREAER IARDLSDPNV LVLALNGRFM QSFQRPGNTA ERDAIASELI AVSTRHDLAT FEILGHLIKI QVGAALGDID TAERHVAAAE RLADVHETPL VRVLTAAFAA MRLAQASGDP AEAARAYRAV ATELAGAGMP GVEAGMFPLA LLSLRLRHGR PAPVDPDLDR GPYRAWVQPL IDLARDDPSA ARRSAAALPE PAADHLYDAL WAVTAHTAIR LEDASLAARA REALDPLRGQ IAGGTTAIVS FGPVDDILAE LDAHRGS
|
| |