Gene Snas_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3801 
Symbol 
ID8885001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4052742 
End bp4055723 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512550 
Protein GI291301272 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000525265 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTCCCG TTGGTGACTC ACCGCTCATT AGGCTGCTCG GAGAGGTATC GGTTCTCGTC 
CAGGGAAAAC CGGTCTCGGC GGGTACGCCG AAGCAGGCGT GTGTGTTGGC GTGTCTGGCC
TGGACGCCGG GTACTCCGGT GGACACTGAC ACGATCATCG AGCGGGTCTG GGACGGTGAC
GCTCCGACCA ATCCGCGGAA CACGTTGTCG CCGTATGTGA CCCGGTTGCG TTCGCTGCTG
TCGGACACCG GTGCGACCAT CACCGGTAAG AGCGGCACCT ACACCCTCAA CATCGCCGAC
ACCGACGTCG ACGTCCACGC CATGCGCGCC TGGGCCACGC AGGCCCGGGG ACTGGCGGCC
ACCGACCCGG CCCGCGCGGT CGCGCTGCTG CGGGCGGCGC TGGGCCTGTG GCGGGGACGA
CCGCTGTCGC GCGTCGACTG CCGCTGGGCG GACTCGATCT CCGTGGCCGT GGAACCCGAG
CACGTCGAGG TGTGGACGCA GCTGTTCGAG GTGGAACTGT CCCGCGACAA CCACGCCCGC
GTCATCGGCG AACTGTCCGA AGTGGTGGCC GACAATCCCG TCAACGAGAA CCTCATCGGG
CAGTACCTGG TGGCGCTGTA CCGCTGCGGC CGTGGCATCG AGGCGCTGGA GTGCTACCGG
CAGGCCCGGC AGCGGCTGCG GGACGAGTTC GGCGTGGACC CCTCGCCCAG GCTCGCCGAC
ATCCAGCGCC GCATCCTCGC CGACGACCGG GACCTGCTGT CGGCCCGCAC CGACGTGGCC
GCCGAGGTCG CGGTGCCCCG GCAGCTGCCC GCCGCCCCGG CCGGGATCAT CGGACGCGAC
ACACTGATCA ACGCCGCCGA CGCCGCGGTG GCCCGTGACC ACGACACCGT CGCCTTCGTC
GGACCCGGCG GCGCCGGGAA GACCGCGCTA GCGCTGACCT GGGCGCACAA ACTCTCCGCC
CGCTTCACCG ACGGCCAGCT GTTCGCCGAC CTGCGCGGCT TCTCCGGCAC CGAACCGGCG
CCACCGGCCC GGGTGCTGAC GGGTTTCCTT CGGGCTTTGG GGGTTCCGGC CTCCCGGCTG
CCCGCCGGGG AGAGCGAACT GTCGGCGCTG TTCCGCGCCA CCGTCGCCGG GCGCCGGATC
CTCATCGTCT TGGACAACGC CGTCGGGCCG AGACAGCTGC GGCCGCTGCT GCCCGGCGAC
GACGGCTGCC TGACCGTGGC CACCAGCCGC GACAACCTCA GCGGCCTTGA CCGCGTCCAC
GCCATCACCG TCGCCGAACT GTCCAGCGCC GACTCGCAGC GGGTGCTGGC CGAGACGCTG
GGCGCCCGCC CCGGACCGGT CGCCGCCACC CTGATCGCGC GGCTCGCCGA GCAGTGCGGG
AACCTGCCGC TGGCGTTGCG GCTGGCCGCC AGCCAGCTGT CGGGCGGCTC GGACCACGAA
CTGTCCGAAC TGGTCGACGA CCTGGACTCC GGCGATCGGC TCGCGACACT GTCCTATCCG
GAGGACTCGC CCGGCGGTGT CGCCGCCGCC ATCGAGACGT CCTACAAGGT CCTGGCGCCT
GGGCCCCGGC ACCTGTTCCG GCTGCTGGGG CTGCACCCTT CGGGCACCGC CGACGTCGAA
GCGCTGGCGG CCATGGCCGA CGCCGACCTG GCCGAGACCG AACGGATCCT GTCCACACTC
GCCGCCGCAC ACCTGGTGGA GCCCACTGGG GACGGCCGCT GGGGCATGCA CGACCTGGTC
GCCGAGTACG CCTCCCGGCT CGACGCCCCC GACCGGGAGG CCGCGCTGAA ACGCGGCCTC
GACTGGTACC TGGCCGCGGT GCTGGCCGCC GAGAACGCCT CCGGCGGTGG CCAGGTCGCC
ACCAGGACAC CGGAGGTCGC GGCACCGGTA CCGACCTTCG CCGACCCCGA CGCGGCCCTG
GCCTGGCTGG ACTCCCGCTA CCCGACCCTG GTGTCGGCGG TGTCCTTCGC CGACGACGCC
GGGTTCCCCG AACACGCGAT CGGCATCGCC GGGTCGCTCA CCGACTACTG CTACAACGGC
GGCCGCGTCG AGGACTGGGT GCACCTGCTC CGGGTGGCGC TGTCGGCCGC GCGGCGGCTG
GGCGACCCGG TCGTCATCAA CCGCATGCAC GTGCTGCTGG GCAGCGGATA CCGGCGGCTG
AACCGCTCCG ACATCGCCAT CGACCACTAT CGACAGGCGA TGGACGCGGC CCGGCAGGCC
CGGGACACCT ACCGGCAGGC CGTCACCGGT TTCGCCCGCG CCTACGTCCA CCGCGACCAC
GGCGAGTACG AGCAGGCCCG CATCGCCTGC GAGGCCGCGA TACCGCACCT GCGCGAGCAC
GGCGACGTCC GCACCGAGGC CAACCTGTTC ACCGACCTGG CGCTGCTGGC GATCCTGCGC
GGCGACTACC CCGAGGCCAC CCGCCTCAAC GACACCGCCC GCCGACTCGC CGAGGAGTAC
CGGCTGCGTT CGGTCATGCC CTACGTCATC GAGTACTCCG GCCGGATCCT CTACCGGCAG
GGCCGTCTGG ACGAGGCCGC GGCCGCCTTC GCGTCGGTGC TGTCCGGGTT CGGCGAGGTG
GGGGAGTACG GCGCGGCGCT GATCGCCAGT CAGCTGGCCG TGGTTCAGTC CCGGCTCGGC
CACGTCGACG CCGCCCGCGC CCGGCACCTG ACGGCACTGT CGGCCACCGC CGATCCGTCC
ACACCGGACG ACGACCGGGC CGCGGTGCTG AGCGACTCGG GACTGTCGTT CCGGCTCGCG
GGCCAGCCCG AACAAGCCCT GGAACACCAC CGCGAGGCGC TGTTCGTCGC CGAACGCGGC
GGCATCCCGT ACCAGCAGGC TCGCGCCCAC CATGGACTGT GTCTGGCGTT TCGGGCGCTG
TCCGACACCG ACCGGGCCGA GGAGCACTGG CGCGAGGCCC TGGACATCCA CACCCGGCTG
GGCACCGCCG AGGCCACCGA ACCCGGCCAC CCGATGTACT GA
 
Protein sequence
MPPVGDSPLI RLLGEVSVLV QGKPVSAGTP KQACVLACLA WTPGTPVDTD TIIERVWDGD 
APTNPRNTLS PYVTRLRSLL SDTGATITGK SGTYTLNIAD TDVDVHAMRA WATQARGLAA
TDPARAVALL RAALGLWRGR PLSRVDCRWA DSISVAVEPE HVEVWTQLFE VELSRDNHAR
VIGELSEVVA DNPVNENLIG QYLVALYRCG RGIEALECYR QARQRLRDEF GVDPSPRLAD
IQRRILADDR DLLSARTDVA AEVAVPRQLP AAPAGIIGRD TLINAADAAV ARDHDTVAFV
GPGGAGKTAL ALTWAHKLSA RFTDGQLFAD LRGFSGTEPA PPARVLTGFL RALGVPASRL
PAGESELSAL FRATVAGRRI LIVLDNAVGP RQLRPLLPGD DGCLTVATSR DNLSGLDRVH
AITVAELSSA DSQRVLAETL GARPGPVAAT LIARLAEQCG NLPLALRLAA SQLSGGSDHE
LSELVDDLDS GDRLATLSYP EDSPGGVAAA IETSYKVLAP GPRHLFRLLG LHPSGTADVE
ALAAMADADL AETERILSTL AAAHLVEPTG DGRWGMHDLV AEYASRLDAP DREAALKRGL
DWYLAAVLAA ENASGGGQVA TRTPEVAAPV PTFADPDAAL AWLDSRYPTL VSAVSFADDA
GFPEHAIGIA GSLTDYCYNG GRVEDWVHLL RVALSAARRL GDPVVINRMH VLLGSGYRRL
NRSDIAIDHY RQAMDAARQA RDTYRQAVTG FARAYVHRDH GEYEQARIAC EAAIPHLREH
GDVRTEANLF TDLALLAILR GDYPEATRLN DTARRLAEEY RLRSVMPYVI EYSGRILYRQ
GRLDEAAAAF ASVLSGFGEV GEYGAALIAS QLAVVQSRLG HVDAARARHL TALSATADPS
TPDDDRAAVL SDSGLSFRLA GQPEQALEHH REALFVAERG GIPYQQARAH HGLCLAFRAL
SDTDRAEEHW REALDIHTRL GTAEATEPGH PMY