Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3782 |
Symbol | |
ID | 8884982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4037007 |
End bp | 4039955 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512532 |
Protein GI | 291301254 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.519842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00738187 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTACTC AAGGCGATGA ACCGCTGATC CGGTTGCTCG GTGACGTGTC GATCGTCGTC AACGGTGAGC CGGTCTCGGC GGGTACGCCG AAGCAGGCGT GTGTGTTGGC GTGTCTGGCC TGGACGCCGG GTACGCCGGT GGACACTGAC ACGATCATCG AGCGGGTCTG GGACGGTGAC GCCCCCGCCA ATCCGCGTAA TACCTTGTCG CCGTATGTGA CCCGGCTGCG TTCCCTCCTG TCCGACACGG GCGCGACCAT CACCGGTAAG AGCGGTACCT ACACCCTCAA CGTCGCCGAC ACCGACGTCG ACGTCCACGC CATGCGAACC CTCGCCGCCC GCGCCCGCGA CGTCCCCGCC CACCACCCCG ACGCGATCAG CCTGCGCCGC AAGGCATTGC GGCTGTGGCG CGGCTCGCCG CTGTCGCGCG TCGACTGCCG CTGGGCCGAC GCGATCTCGG TGACCCTCGA ACCCGAACAC GTCAGCCTGT GGACCGACCT GTTCGACGCG GAACTGGCCC AGGGCAACCA CGCCACCATC CTGGGAGAAC TGTCGCAGCT TCACAACCGC CATCCCGACA ACGAGAACCT CATCGGACAG TACATGGTGG CGCTGTACCG GTGCGGACGC GGCGTGGAGG CCCTGGAGAC GTTCCGGCGC ACCGATTCCC GGTTCCGCCG CGAGTTCGGC ATCGACGCGT CGCGGCGGCT GGCCGAACTA CAGCGCCGCA TCCTCGCCGA CGATCCGTCC CTGTCGGACA CCGCCGCCGC CAAGACCAGC ACCGAACCCG TCCCGGCGCA GCTGCCCGAG CCGCCGCCGG GCTTCGTGGG CCGCAGTGCC GAACTGCGCG CCGCCGACGA CCAGGTGGCG CGGGGACGGC GCGCGGTGGC GTTCGTCGGT CCCGGCGGCA TGGGCAAGAC CTCGCTGGCC CTGTGGTGGG CGCATCGCGT CGCCGCCGAT TTCGCCGACG GCCAGCTGTT CGCCGACCTG CGCGGCTACT CCGGCGAGGA ACCGGTACCG ACATCCCGGA TCCTGGCCGG ATTCCTGCGA GCCCTTGGCC ACAACGACTC CGACCTGCCC ACAGGGGAGT CGGAGCTGGC GGGCATGTAC CGCACGGCGC TGGCCAAGAA GAACGTCCTC ATCGTGCTGG ACAACGCCGC CGGCCCCGCG CAGGTCCGGC CACTGCTGCC CGGCGACGGC AACTGCCTGG CCGTCCTCAC CAGCCGCGAC GACCTGCGCG GCCTCAAAGT CGACCACGAC GTGGCCACCA TCGGCGTCGG CGAACTGTCC ACACCCGACG CGGTGGCGAT CCTGTCGGCC CACGTCACCG CCGCCCCCGA GGCCCGCGAC CAGCTGGAGC GACTGGCGGA ACTGTGCGGC CACCTGCCGC TGGCGATCCG GCTGGCCGCC AGTCGCCTGC CCTCCGGTTC GGCCGAGGAA CTGTCCGCAC TGGTCACCGA CCTGGAGTCC GGCGACCGGC TGGCGACCCT GTCGCGGCCC GGCGAGACCG TCGGCGGCCT GGCCGCCACC ATCGAGTCCT CCTACCGCCG ACTGGATCCC GCCGCCCGCG AGGTCTTCGA ACTGCTGGGC GCCCACCCCA GCGGCGAGGC CGACGCGGCG GCACTGGCCA ACGCCGTGGG CCGTCCGGTC GCGCAGGTGA ACCGCAGCCT GCGGCGCCTG GAGGCCGCCC ACCTCGCGCA CCGGGTCGAG TCCCGTTGGG GCGTCCACGA TCTGGTCAAG GAGTTCGCGG CCCGCGCGGC CGGTGACGTC GGTGACCGGC TGGGCCGCCT GTACGACTGG TACGCGATGG GCGTCCTGGG CGCCGACGGC CACATCTACG GTCCCCGGCC GCAACTGCGG GTCGACACGA CCGTCCCGCC GCCGGAGTTC GACGGCGCGG GCGAGGCCGA CGCCTGGGTG GAGCCGCGCG TCGACGTGTT CCTGGCCGCG ATCGACCAGG CCGCCGCGCG GCACCGCGAC CAGGCGCTGC GACTGGTGAC AGGACTGTGG CGGTACCTGT GGCGGCGCGG CATGTACGAC GCCTGGATCA CCGCCCAGCA CACCGTTCTG GACGCGTTCG GCGAGGACGC CGACGCGCGG GTGCGCTGCC AGCTGCTGAT GGGCCTCGGC AACGCCTACA ACTGCGCCGG ACGGCACGAC AAGGGCGCCG GGCACATCCG CGCCGCCTAC GACCTGGCCC AGGACCTGGA CGTGGAGGAA CGCGCCAAGA CCGCCGGGGC CCTGGGGGTG CTGTTGAGCG ACTGCGGTTC CAGCCGCGAG GCACGGTCCT ATCTGGAACA GGCCCGGGAC CTGTACGCCA AGGCCGCGGT GCCCGGCCGG GTGGCGGTGT CGCAGTACGA ACTGGGACGG CTGGACTACC GCGACGAGGT CTTCGCCGAC GCCCTGGACC GGTTCGCCGA GGCGGTGGCG GTGTGGGAGT CGCTGGCCCC GGGTTCGGTC GCGATCGGGC TGACCAGCCT GGGCCGCACC CAGGCGCGAC TGGGACGGCT GGCCGAGGCC GCGACCACTC TGGAGCGCGC GGTGGCGGCG GCCCGCGACC TGTCGCACCC GTCGGTGGAG AGCTACGCGC TGAGCATCCT GGCGATCGTG CTGCACCGCA ACGGCAGTAC CCGGGCGGGC TGGCGACGGC ACCGGGAGGC GATCGCGTTG ACGCCCCGGG TCACCCACGC CGACCTGCGC GTCGAGATCC ACAACTTCGC CGGGGTGTTC CGCGCCCAGG CCGAGCAGCC CGAGGCCGCG ATCGAGTACC ACCACGTCGC GCTGGCACTG GCCGAGGAGG CCGACGCCGG ATACGAGAAG GCCCGGGCCC ACCAGGGACT CGCCGACGCC TACGCCGCGC GGGACGACGC CGAGGCGGCG GCACACCGGC GGGCCGCGGC GGCCGGATTC GCGCTGGCAC AGACCCCGCC GCCGAGCGAC CTGCTGTGA
|
Protein sequence | MLTQGDEPLI RLLGDVSIVV NGEPVSAGTP KQACVLACLA WTPGTPVDTD TIIERVWDGD APANPRNTLS PYVTRLRSLL SDTGATITGK SGTYTLNVAD TDVDVHAMRT LAARARDVPA HHPDAISLRR KALRLWRGSP LSRVDCRWAD AISVTLEPEH VSLWTDLFDA ELAQGNHATI LGELSQLHNR HPDNENLIGQ YMVALYRCGR GVEALETFRR TDSRFRREFG IDASRRLAEL QRRILADDPS LSDTAAAKTS TEPVPAQLPE PPPGFVGRSA ELRAADDQVA RGRRAVAFVG PGGMGKTSLA LWWAHRVAAD FADGQLFADL RGYSGEEPVP TSRILAGFLR ALGHNDSDLP TGESELAGMY RTALAKKNVL IVLDNAAGPA QVRPLLPGDG NCLAVLTSRD DLRGLKVDHD VATIGVGELS TPDAVAILSA HVTAAPEARD QLERLAELCG HLPLAIRLAA SRLPSGSAEE LSALVTDLES GDRLATLSRP GETVGGLAAT IESSYRRLDP AAREVFELLG AHPSGEADAA ALANAVGRPV AQVNRSLRRL EAAHLAHRVE SRWGVHDLVK EFAARAAGDV GDRLGRLYDW YAMGVLGADG HIYGPRPQLR VDTTVPPPEF DGAGEADAWV EPRVDVFLAA IDQAAARHRD QALRLVTGLW RYLWRRGMYD AWITAQHTVL DAFGEDADAR VRCQLLMGLG NAYNCAGRHD KGAGHIRAAY DLAQDLDVEE RAKTAGALGV LLSDCGSSRE ARSYLEQARD LYAKAAVPGR VAVSQYELGR LDYRDEVFAD ALDRFAEAVA VWESLAPGSV AIGLTSLGRT QARLGRLAEA ATTLERAVAA ARDLSHPSVE SYALSILAIV LHRNGSTRAG WRRHREAIAL TPRVTHADLR VEIHNFAGVF RAQAEQPEAA IEYHHVALAL AEEADAGYEK ARAHQGLADA YAARDDAEAA AHRRAAAAGF ALAQTPPPSD LL
|
| |