Gene Snas_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3884 
Symbol 
ID8885084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4149406 
End bp4152378 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content73% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512632 
Protein GI291301354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.039717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.641225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTTTC GCATCCTCGG CTCCCTCGAA GTGCGGCACG ACGGTGCCGT GATCCCGGTT 
CGGGGGCGTC AGCAACCCAA GGTGCTGGCC ATGCTGCTGC TGCAGGCCGG TCACGTCGTG
TCCGTCGACA GGCTCGTGGA CGCGCTGTGG GACGACGATC CGCCCGCCAC CGCCCGCCGC
CAGGTGCAGA ACACCGTCGC GGCGCTGCGG CGCACCCTGT CGGTGGCCGA GGGCCCGCTC
ATCACCGCCG TCGGCGAGGG CTACCGGCTG TCCACCGCCC ACCTCGACTC GCTCCAGTTC
AACGACTACG TCCGGCAGGC AGCTGCCGCC GCCGAGAACA ACCGGCTCGC CGAGGCCCAC
ACCCGACTGT GCCAGGCCCT GGAACTGTGG CGCGGCGAGG TGCTGACCGG CATGAACGGC
CGGGTGCTGC GCGCCTTCGC CGAGCAGCTG GAGGAGGCGC GGCTGCGGGC CTTCGAGACC
CGCATGGACA TCGAGCTGCG GCTGGGACAG CCCGGCCGCA TCGTCGAGGA GGCGCGGCGG
CTGCTGACCG AGCACCCGTA CCGGCAGGAC ATGGCCGCGC TGCTGATGCG GGCGCTGCAC
CAGTGCGGAC GCGGCACCGA GGCGCTGGAG GTGTACGCGA CGCTGCGGTC CCGGCTGGCC
GAGGAACTGG GCATCGACCC GACCCGGGCG CTGCGCGACC TGCACCTGGA GATCCTGCGC
GCCGACGGCG AGGGCGCCCG GACCTCCGCT CCGGCCCGGC AGTCCGTCGG GGAGGTCCCG
GCGCAGCTGC CCGCCGACAT CGCCGGATTC ACCGGCCGCG CCTCGCAACT GGCGGCGCTG
GACGCCATGC TGGACCAGGC CGACGGCGCC TCGGTGCTGG CCACCGTCGC CGGTGCCGGC
GGCATCGGCA AGACCGCGCT GGCGGTGCAC TGGGCCCGGC TGCGCGCCGA CCGCTTCCCC
GACGGGCAGC TGTTCGTCAA CCTGCGCGGC TTCGACCACA GTGCCCCGTT GTCGGCCCAC
GACGTGCTGA CCCGGTTCCT GCGCGGCTTC GGGTTCAACT CCGAGGCGAT ACCGTCCGAC
CTGGACGAAG CCGCCGCGCT GTACCGCACC TACCTGCACG GCAAACGGGT CCTGATCCTG
CTGGACAACG CCGCCCGGGT CGACCAGGTC CGGCCGCTGC TGCCCGCCGG TCCCGGCTGC
TTCGCCCTGG TCACCAGCCG CGACTCACTG GCGGGCCTGA CAGCCCTGGA CGGCGCCCGG
CGCGTCGAGG TCGACACCCT GGGGCCGCGC GAGTCGCTGA GGCTGCTGGC CGACCTGATC
GGACAGTCCC GTCTGGACGC CGAAGTCGAG GCCGCCACCG CCATCACCGA ACTGTGCGGA
CGACTGCCGC TGGCGCTGCG CGTCGTCGGC GCCAACCTGG CCGCCCGCCC CTCCGAACGG
CTCGCCGAGG TCGCCGCCGA ACTGGCCGGA GCCGACCGCT TGGAACGGAT GGTGGTGCCG
GGCGACACCC GCGCCGCCGT CGCCGACTGC ATCACGCTGT CGCTGCCGTC CATCGACGAG
AACACCCGGC GGTTCTTCCT GCACCTGGGA CTGGTGCCCG GCACCGAGAT CTCGGCGAGC
ATGGCCGCCG CGGTCGTTGA CGGCACCGAG GCCGAGGCCC GCAGGCTGCT GGGTCGGCTC
GCCCACGCGC ACCTCATCGA CCCGCAGAGC GACGAGCACT GGCGCTTCCA CGACCTGGTG
CGGCTGTACG CCCACGCCCG GGCCGCCGAC GACCTGAAAC CTGCCGACCG CGACGCGGCG
ATGGAGCGGC TGCTGGACTG GTACGCCGAC TCCCAGACCA AACTGCGGCA CGAGGACCGG
GTGGCCACGG TGCTGGCGCT GGTCGACCAT CCCCGGGTGT GGAAGGCCGC CACCAACTTC
CACGCCTCGG TCCACGACGG CTACGACCCC GACGAGATCC GCCGGGTCGT CAGGATCGCG
CTCGGCGTCG CCGAAGCCCA CGACGACGCC GCCGGTCAGG CGTGGATGCA CAACCTGGTC
GCGGGCACCT ACTGGGCGGC GCGACGGTTG CCCGAGGCCG TGGCGGCCGG GGAGCTCGCG
TTGGAGACGG CGCGGCGCAG CGGCGACCCG CTCCTGATCG CGCGCCACCT CAACAACATG
GCGTCGTTCC GTTCGCTGAG CGGCGACAAC CTGGCCGCCC GGAGGATCCT CGACGAGTCG
CTCAAGATCG CCGAGGAGTC CGCCGACCCG TGGGCCATCT CCGCCCGGCT CGACAGCCTC
GGCGAACTGT GCATCCATTT GGGACAATAC GCCGAAGCCG AGACCCACCT GCGGCGATCG
CTGGCGGTAC GTCCGCCGCA GCCGCCGGGC AAACGCTGGC CCAAGACCCC GTCCAAACTG
GCCCACCTGT GCCTGGACAC CGGCCGCTAC ACCGAGGGCC TGGAATACGT CGAACTCATC
CTGGCCGAGG CGATGTCCCG GCACCATCCG CGGGCGCTGT GCCTGCGGGC ATCGCTGCGG
CTGGCCATGG GCGACCTGGA CGCCGCCTAC GCCGACTTCA CCGAGTCCTT CGCGGTGGAA
CGCCAGAACC GGTACGTGGG TGAGGCCGCG GACCTGCTGA TCCCGTACGC GCACTGCCTC
AGCGAACGCG GCGAGGCGCA GTCCGCCCTC AAACACGCCC GCGAATGCCT GGAGTGGGGC
CGATCCAGCG GCATCCGCCG AGACGAGGCC GCCGCCAGTC TGCTGCTGTC CACGATCCAC
GCCCGGCAGG AGGACTACGC CACCGCGGCG ACCTTCGCGC GCGAGGCGTG CCGATTGTTC
GCGTCGATGT CCGAACCGCT GCGGCACGGA CGCTCCCTTG TGGCGCTGGC CCGGGCGCTG
ACGGGTCTGG GGGTGCCGGA AGCCGCCGAG CATCGGGCGG CGGCCGAGGC GATCTTCGAA
CGGCTCGGCG TCACCGCGTT CGAGACCCGA TAG
 
Protein sequence
MDFRILGSLE VRHDGAVIPV RGRQQPKVLA MLLLQAGHVV SVDRLVDALW DDDPPATARR 
QVQNTVAALR RTLSVAEGPL ITAVGEGYRL STAHLDSLQF NDYVRQAAAA AENNRLAEAH
TRLCQALELW RGEVLTGMNG RVLRAFAEQL EEARLRAFET RMDIELRLGQ PGRIVEEARR
LLTEHPYRQD MAALLMRALH QCGRGTEALE VYATLRSRLA EELGIDPTRA LRDLHLEILR
ADGEGARTSA PARQSVGEVP AQLPADIAGF TGRASQLAAL DAMLDQADGA SVLATVAGAG
GIGKTALAVH WARLRADRFP DGQLFVNLRG FDHSAPLSAH DVLTRFLRGF GFNSEAIPSD
LDEAAALYRT YLHGKRVLIL LDNAARVDQV RPLLPAGPGC FALVTSRDSL AGLTALDGAR
RVEVDTLGPR ESLRLLADLI GQSRLDAEVE AATAITELCG RLPLALRVVG ANLAARPSER
LAEVAAELAG ADRLERMVVP GDTRAAVADC ITLSLPSIDE NTRRFFLHLG LVPGTEISAS
MAAAVVDGTE AEARRLLGRL AHAHLIDPQS DEHWRFHDLV RLYAHARAAD DLKPADRDAA
MERLLDWYAD SQTKLRHEDR VATVLALVDH PRVWKAATNF HASVHDGYDP DEIRRVVRIA
LGVAEAHDDA AGQAWMHNLV AGTYWAARRL PEAVAAGELA LETARRSGDP LLIARHLNNM
ASFRSLSGDN LAARRILDES LKIAEESADP WAISARLDSL GELCIHLGQY AEAETHLRRS
LAVRPPQPPG KRWPKTPSKL AHLCLDTGRY TEGLEYVELI LAEAMSRHHP RALCLRASLR
LAMGDLDAAY ADFTESFAVE RQNRYVGEAA DLLIPYAHCL SERGEAQSAL KHARECLEWG
RSSGIRRDEA AASLLLSTIH ARQEDYATAA TFAREACRLF ASMSEPLRHG RSLVALARAL
TGLGVPEAAE HRAAAEAIFE RLGVTAFETR