Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3801 |
Symbol | |
ID | 8885001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4052742 |
End bp | 4055723 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512550 |
Protein GI | 291301272 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000525265 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTCCCG TTGGTGACTC ACCGCTCATT AGGCTGCTCG GAGAGGTATC GGTTCTCGTC CAGGGAAAAC CGGTCTCGGC GGGTACGCCG AAGCAGGCGT GTGTGTTGGC GTGTCTGGCC TGGACGCCGG GTACTCCGGT GGACACTGAC ACGATCATCG AGCGGGTCTG GGACGGTGAC GCTCCGACCA ATCCGCGGAA CACGTTGTCG CCGTATGTGA CCCGGTTGCG TTCGCTGCTG TCGGACACCG GTGCGACCAT CACCGGTAAG AGCGGCACCT ACACCCTCAA CATCGCCGAC ACCGACGTCG ACGTCCACGC CATGCGCGCC TGGGCCACGC AGGCCCGGGG ACTGGCGGCC ACCGACCCGG CCCGCGCGGT CGCGCTGCTG CGGGCGGCGC TGGGCCTGTG GCGGGGACGA CCGCTGTCGC GCGTCGACTG CCGCTGGGCG GACTCGATCT CCGTGGCCGT GGAACCCGAG CACGTCGAGG TGTGGACGCA GCTGTTCGAG GTGGAACTGT CCCGCGACAA CCACGCCCGC GTCATCGGCG AACTGTCCGA AGTGGTGGCC GACAATCCCG TCAACGAGAA CCTCATCGGG CAGTACCTGG TGGCGCTGTA CCGCTGCGGC CGTGGCATCG AGGCGCTGGA GTGCTACCGG CAGGCCCGGC AGCGGCTGCG GGACGAGTTC GGCGTGGACC CCTCGCCCAG GCTCGCCGAC ATCCAGCGCC GCATCCTCGC CGACGACCGG GACCTGCTGT CGGCCCGCAC CGACGTGGCC GCCGAGGTCG CGGTGCCCCG GCAGCTGCCC GCCGCCCCGG CCGGGATCAT CGGACGCGAC ACACTGATCA ACGCCGCCGA CGCCGCGGTG GCCCGTGACC ACGACACCGT CGCCTTCGTC GGACCCGGCG GCGCCGGGAA GACCGCGCTA GCGCTGACCT GGGCGCACAA ACTCTCCGCC CGCTTCACCG ACGGCCAGCT GTTCGCCGAC CTGCGCGGCT TCTCCGGCAC CGAACCGGCG CCACCGGCCC GGGTGCTGAC GGGTTTCCTT CGGGCTTTGG GGGTTCCGGC CTCCCGGCTG CCCGCCGGGG AGAGCGAACT GTCGGCGCTG TTCCGCGCCA CCGTCGCCGG GCGCCGGATC CTCATCGTCT TGGACAACGC CGTCGGGCCG AGACAGCTGC GGCCGCTGCT GCCCGGCGAC GACGGCTGCC TGACCGTGGC CACCAGCCGC GACAACCTCA GCGGCCTTGA CCGCGTCCAC GCCATCACCG TCGCCGAACT GTCCAGCGCC GACTCGCAGC GGGTGCTGGC CGAGACGCTG GGCGCCCGCC CCGGACCGGT CGCCGCCACC CTGATCGCGC GGCTCGCCGA GCAGTGCGGG AACCTGCCGC TGGCGTTGCG GCTGGCCGCC AGCCAGCTGT CGGGCGGCTC GGACCACGAA CTGTCCGAAC TGGTCGACGA CCTGGACTCC GGCGATCGGC TCGCGACACT GTCCTATCCG GAGGACTCGC CCGGCGGTGT CGCCGCCGCC ATCGAGACGT CCTACAAGGT CCTGGCGCCT GGGCCCCGGC ACCTGTTCCG GCTGCTGGGG CTGCACCCTT CGGGCACCGC CGACGTCGAA GCGCTGGCGG CCATGGCCGA CGCCGACCTG GCCGAGACCG AACGGATCCT GTCCACACTC GCCGCCGCAC ACCTGGTGGA GCCCACTGGG GACGGCCGCT GGGGCATGCA CGACCTGGTC GCCGAGTACG CCTCCCGGCT CGACGCCCCC GACCGGGAGG CCGCGCTGAA ACGCGGCCTC GACTGGTACC TGGCCGCGGT GCTGGCCGCC GAGAACGCCT CCGGCGGTGG CCAGGTCGCC ACCAGGACAC CGGAGGTCGC GGCACCGGTA CCGACCTTCG CCGACCCCGA CGCGGCCCTG GCCTGGCTGG ACTCCCGCTA CCCGACCCTG GTGTCGGCGG TGTCCTTCGC CGACGACGCC GGGTTCCCCG AACACGCGAT CGGCATCGCC GGGTCGCTCA CCGACTACTG CTACAACGGC GGCCGCGTCG AGGACTGGGT GCACCTGCTC CGGGTGGCGC TGTCGGCCGC GCGGCGGCTG GGCGACCCGG TCGTCATCAA CCGCATGCAC GTGCTGCTGG GCAGCGGATA CCGGCGGCTG AACCGCTCCG ACATCGCCAT CGACCACTAT CGACAGGCGA TGGACGCGGC CCGGCAGGCC CGGGACACCT ACCGGCAGGC CGTCACCGGT TTCGCCCGCG CCTACGTCCA CCGCGACCAC GGCGAGTACG AGCAGGCCCG CATCGCCTGC GAGGCCGCGA TACCGCACCT GCGCGAGCAC GGCGACGTCC GCACCGAGGC CAACCTGTTC ACCGACCTGG CGCTGCTGGC GATCCTGCGC GGCGACTACC CCGAGGCCAC CCGCCTCAAC GACACCGCCC GCCGACTCGC CGAGGAGTAC CGGCTGCGTT CGGTCATGCC CTACGTCATC GAGTACTCCG GCCGGATCCT CTACCGGCAG GGCCGTCTGG ACGAGGCCGC GGCCGCCTTC GCGTCGGTGC TGTCCGGGTT CGGCGAGGTG GGGGAGTACG GCGCGGCGCT GATCGCCAGT CAGCTGGCCG TGGTTCAGTC CCGGCTCGGC CACGTCGACG CCGCCCGCGC CCGGCACCTG ACGGCACTGT CGGCCACCGC CGATCCGTCC ACACCGGACG ACGACCGGGC CGCGGTGCTG AGCGACTCGG GACTGTCGTT CCGGCTCGCG GGCCAGCCCG AACAAGCCCT GGAACACCAC CGCGAGGCGC TGTTCGTCGC CGAACGCGGC GGCATCCCGT ACCAGCAGGC TCGCGCCCAC CATGGACTGT GTCTGGCGTT TCGGGCGCTG TCCGACACCG ACCGGGCCGA GGAGCACTGG CGCGAGGCCC TGGACATCCA CACCCGGCTG GGCACCGCCG AGGCCACCGA ACCCGGCCAC CCGATGTACT GA
|
Protein sequence | MPPVGDSPLI RLLGEVSVLV QGKPVSAGTP KQACVLACLA WTPGTPVDTD TIIERVWDGD APTNPRNTLS PYVTRLRSLL SDTGATITGK SGTYTLNIAD TDVDVHAMRA WATQARGLAA TDPARAVALL RAALGLWRGR PLSRVDCRWA DSISVAVEPE HVEVWTQLFE VELSRDNHAR VIGELSEVVA DNPVNENLIG QYLVALYRCG RGIEALECYR QARQRLRDEF GVDPSPRLAD IQRRILADDR DLLSARTDVA AEVAVPRQLP AAPAGIIGRD TLINAADAAV ARDHDTVAFV GPGGAGKTAL ALTWAHKLSA RFTDGQLFAD LRGFSGTEPA PPARVLTGFL RALGVPASRL PAGESELSAL FRATVAGRRI LIVLDNAVGP RQLRPLLPGD DGCLTVATSR DNLSGLDRVH AITVAELSSA DSQRVLAETL GARPGPVAAT LIARLAEQCG NLPLALRLAA SQLSGGSDHE LSELVDDLDS GDRLATLSYP EDSPGGVAAA IETSYKVLAP GPRHLFRLLG LHPSGTADVE ALAAMADADL AETERILSTL AAAHLVEPTG DGRWGMHDLV AEYASRLDAP DREAALKRGL DWYLAAVLAA ENASGGGQVA TRTPEVAAPV PTFADPDAAL AWLDSRYPTL VSAVSFADDA GFPEHAIGIA GSLTDYCYNG GRVEDWVHLL RVALSAARRL GDPVVINRMH VLLGSGYRRL NRSDIAIDHY RQAMDAARQA RDTYRQAVTG FARAYVHRDH GEYEQARIAC EAAIPHLREH GDVRTEANLF TDLALLAILR GDYPEATRLN DTARRLAEEY RLRSVMPYVI EYSGRILYRQ GRLDEAAAAF ASVLSGFGEV GEYGAALIAS QLAVVQSRLG HVDAARARHL TALSATADPS TPDDDRAAVL SDSGLSFRLA GQPEQALEHH REALFVAERG GIPYQQARAH HGLCLAFRAL SDTDRAEEHW REALDIHTRL GTAEATEPGH PMY
|
| |