Gene Snas_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3782 
Symbol 
ID8884982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4037007 
End bp4039955 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content73% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512532 
Protein GI291301254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.519842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00738187 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTACTC AAGGCGATGA ACCGCTGATC CGGTTGCTCG GTGACGTGTC GATCGTCGTC 
AACGGTGAGC CGGTCTCGGC GGGTACGCCG AAGCAGGCGT GTGTGTTGGC GTGTCTGGCC
TGGACGCCGG GTACGCCGGT GGACACTGAC ACGATCATCG AGCGGGTCTG GGACGGTGAC
GCCCCCGCCA ATCCGCGTAA TACCTTGTCG CCGTATGTGA CCCGGCTGCG TTCCCTCCTG
TCCGACACGG GCGCGACCAT CACCGGTAAG AGCGGTACCT ACACCCTCAA CGTCGCCGAC
ACCGACGTCG ACGTCCACGC CATGCGAACC CTCGCCGCCC GCGCCCGCGA CGTCCCCGCC
CACCACCCCG ACGCGATCAG CCTGCGCCGC AAGGCATTGC GGCTGTGGCG CGGCTCGCCG
CTGTCGCGCG TCGACTGCCG CTGGGCCGAC GCGATCTCGG TGACCCTCGA ACCCGAACAC
GTCAGCCTGT GGACCGACCT GTTCGACGCG GAACTGGCCC AGGGCAACCA CGCCACCATC
CTGGGAGAAC TGTCGCAGCT TCACAACCGC CATCCCGACA ACGAGAACCT CATCGGACAG
TACATGGTGG CGCTGTACCG GTGCGGACGC GGCGTGGAGG CCCTGGAGAC GTTCCGGCGC
ACCGATTCCC GGTTCCGCCG CGAGTTCGGC ATCGACGCGT CGCGGCGGCT GGCCGAACTA
CAGCGCCGCA TCCTCGCCGA CGATCCGTCC CTGTCGGACA CCGCCGCCGC CAAGACCAGC
ACCGAACCCG TCCCGGCGCA GCTGCCCGAG CCGCCGCCGG GCTTCGTGGG CCGCAGTGCC
GAACTGCGCG CCGCCGACGA CCAGGTGGCG CGGGGACGGC GCGCGGTGGC GTTCGTCGGT
CCCGGCGGCA TGGGCAAGAC CTCGCTGGCC CTGTGGTGGG CGCATCGCGT CGCCGCCGAT
TTCGCCGACG GCCAGCTGTT CGCCGACCTG CGCGGCTACT CCGGCGAGGA ACCGGTACCG
ACATCCCGGA TCCTGGCCGG ATTCCTGCGA GCCCTTGGCC ACAACGACTC CGACCTGCCC
ACAGGGGAGT CGGAGCTGGC GGGCATGTAC CGCACGGCGC TGGCCAAGAA GAACGTCCTC
ATCGTGCTGG ACAACGCCGC CGGCCCCGCG CAGGTCCGGC CACTGCTGCC CGGCGACGGC
AACTGCCTGG CCGTCCTCAC CAGCCGCGAC GACCTGCGCG GCCTCAAAGT CGACCACGAC
GTGGCCACCA TCGGCGTCGG CGAACTGTCC ACACCCGACG CGGTGGCGAT CCTGTCGGCC
CACGTCACCG CCGCCCCCGA GGCCCGCGAC CAGCTGGAGC GACTGGCGGA ACTGTGCGGC
CACCTGCCGC TGGCGATCCG GCTGGCCGCC AGTCGCCTGC CCTCCGGTTC GGCCGAGGAA
CTGTCCGCAC TGGTCACCGA CCTGGAGTCC GGCGACCGGC TGGCGACCCT GTCGCGGCCC
GGCGAGACCG TCGGCGGCCT GGCCGCCACC ATCGAGTCCT CCTACCGCCG ACTGGATCCC
GCCGCCCGCG AGGTCTTCGA ACTGCTGGGC GCCCACCCCA GCGGCGAGGC CGACGCGGCG
GCACTGGCCA ACGCCGTGGG CCGTCCGGTC GCGCAGGTGA ACCGCAGCCT GCGGCGCCTG
GAGGCCGCCC ACCTCGCGCA CCGGGTCGAG TCCCGTTGGG GCGTCCACGA TCTGGTCAAG
GAGTTCGCGG CCCGCGCGGC CGGTGACGTC GGTGACCGGC TGGGCCGCCT GTACGACTGG
TACGCGATGG GCGTCCTGGG CGCCGACGGC CACATCTACG GTCCCCGGCC GCAACTGCGG
GTCGACACGA CCGTCCCGCC GCCGGAGTTC GACGGCGCGG GCGAGGCCGA CGCCTGGGTG
GAGCCGCGCG TCGACGTGTT CCTGGCCGCG ATCGACCAGG CCGCCGCGCG GCACCGCGAC
CAGGCGCTGC GACTGGTGAC AGGACTGTGG CGGTACCTGT GGCGGCGCGG CATGTACGAC
GCCTGGATCA CCGCCCAGCA CACCGTTCTG GACGCGTTCG GCGAGGACGC CGACGCGCGG
GTGCGCTGCC AGCTGCTGAT GGGCCTCGGC AACGCCTACA ACTGCGCCGG ACGGCACGAC
AAGGGCGCCG GGCACATCCG CGCCGCCTAC GACCTGGCCC AGGACCTGGA CGTGGAGGAA
CGCGCCAAGA CCGCCGGGGC CCTGGGGGTG CTGTTGAGCG ACTGCGGTTC CAGCCGCGAG
GCACGGTCCT ATCTGGAACA GGCCCGGGAC CTGTACGCCA AGGCCGCGGT GCCCGGCCGG
GTGGCGGTGT CGCAGTACGA ACTGGGACGG CTGGACTACC GCGACGAGGT CTTCGCCGAC
GCCCTGGACC GGTTCGCCGA GGCGGTGGCG GTGTGGGAGT CGCTGGCCCC GGGTTCGGTC
GCGATCGGGC TGACCAGCCT GGGCCGCACC CAGGCGCGAC TGGGACGGCT GGCCGAGGCC
GCGACCACTC TGGAGCGCGC GGTGGCGGCG GCCCGCGACC TGTCGCACCC GTCGGTGGAG
AGCTACGCGC TGAGCATCCT GGCGATCGTG CTGCACCGCA ACGGCAGTAC CCGGGCGGGC
TGGCGACGGC ACCGGGAGGC GATCGCGTTG ACGCCCCGGG TCACCCACGC CGACCTGCGC
GTCGAGATCC ACAACTTCGC CGGGGTGTTC CGCGCCCAGG CCGAGCAGCC CGAGGCCGCG
ATCGAGTACC ACCACGTCGC GCTGGCACTG GCCGAGGAGG CCGACGCCGG ATACGAGAAG
GCCCGGGCCC ACCAGGGACT CGCCGACGCC TACGCCGCGC GGGACGACGC CGAGGCGGCG
GCACACCGGC GGGCCGCGGC GGCCGGATTC GCGCTGGCAC AGACCCCGCC GCCGAGCGAC
CTGCTGTGA
 
Protein sequence
MLTQGDEPLI RLLGDVSIVV NGEPVSAGTP KQACVLACLA WTPGTPVDTD TIIERVWDGD 
APANPRNTLS PYVTRLRSLL SDTGATITGK SGTYTLNVAD TDVDVHAMRT LAARARDVPA
HHPDAISLRR KALRLWRGSP LSRVDCRWAD AISVTLEPEH VSLWTDLFDA ELAQGNHATI
LGELSQLHNR HPDNENLIGQ YMVALYRCGR GVEALETFRR TDSRFRREFG IDASRRLAEL
QRRILADDPS LSDTAAAKTS TEPVPAQLPE PPPGFVGRSA ELRAADDQVA RGRRAVAFVG
PGGMGKTSLA LWWAHRVAAD FADGQLFADL RGYSGEEPVP TSRILAGFLR ALGHNDSDLP
TGESELAGMY RTALAKKNVL IVLDNAAGPA QVRPLLPGDG NCLAVLTSRD DLRGLKVDHD
VATIGVGELS TPDAVAILSA HVTAAPEARD QLERLAELCG HLPLAIRLAA SRLPSGSAEE
LSALVTDLES GDRLATLSRP GETVGGLAAT IESSYRRLDP AAREVFELLG AHPSGEADAA
ALANAVGRPV AQVNRSLRRL EAAHLAHRVE SRWGVHDLVK EFAARAAGDV GDRLGRLYDW
YAMGVLGADG HIYGPRPQLR VDTTVPPPEF DGAGEADAWV EPRVDVFLAA IDQAAARHRD
QALRLVTGLW RYLWRRGMYD AWITAQHTVL DAFGEDADAR VRCQLLMGLG NAYNCAGRHD
KGAGHIRAAY DLAQDLDVEE RAKTAGALGV LLSDCGSSRE ARSYLEQARD LYAKAAVPGR
VAVSQYELGR LDYRDEVFAD ALDRFAEAVA VWESLAPGSV AIGLTSLGRT QARLGRLAEA
ATTLERAVAA ARDLSHPSVE SYALSILAIV LHRNGSTRAG WRRHREAIAL TPRVTHADLR
VEIHNFAGVF RAQAEQPEAA IEYHHVALAL AEEADAGYEK ARAHQGLADA YAARDDAEAA
AHRRAAAAGF ALAQTPPPSD LL