Gene Snas_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3863 
Symbol 
ID8885063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4124373 
End bp4127318 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content69% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512611 
Protein GI291301333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.291313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTCT CGGTCTTGGG ACCGGTGTCG GTGTCCACGT CGGAGGGACC GGTGACCGTC 
GAGGCACCGA AACAGCGGCT GCTGCTGGCG CACCTCATCA GCCGGACCAA CCACCCGCTG
CCGCCCGACA GCCTCATCGA GCTGCTGTGG AGCCACAATC CGCCGTCCTC GGCGCGCAAG
GCGCTGGCCT GGCACGTCAT GCAGCTGCGC AACGTCCTGG GCGGCAAGGA ACGACTGGCC
TGGCACGGCA ACGGATACGT CCTCAACACC GAACGTGACG AAGTGGACGC CGCCCGCTTC
GAGGAACTGC ACCGGCAGGC CACCGCCGTG CGCGACGCCG ATCCCCGGCG GGCCGCGCAA
CTGCTGAACC AGGCGCTGGG CCTGTGGCGC GGCACCGCGT ACGGGGAACT GCCCGAGTCG
GGTGCGCTGC TGGAGGAGGC CAACCGGCTC AACGAACTGC GGCTGGTGGC GCTGGAGGCG
CGGTGCGACA TCGACCTGGA GCTGGGACGG CACGGCGACA TCGTGCCCGA ACTGACCGGT
CTGGTGGCCG ATCACCCGTT CCGGGAGAAG TTCCGGGCCG CGTTGATGCT GGCGCTGTAC
CGCTGCGGCC GCCAGGCCGA CGCCCTGCGC AGCTACCGGG AGGGCCGGAC GCGGTTCGCC
GAGGAGCTCG GCCTGGAACC GGGACCGAGT CTGCGACAAC TGGAACAGCG CATCCTCAAC
GCGGACCCCG GTCTCGACGC GCCGGTCGCC GAGACCGCGA CGATCGCGTC GGTGGTCCCG
GCGCAGCTGC CCGCCGACCT GCGGTCGTTC ACGGGACGGG AGCCGGAGGT GGCGCGGTTG
CTGGACCTGT CCACCGTGGA CACCAATCGG CCCGGGGCCA TCGTGGTCGG TGCCCTGGAC
GGGATGGCGG GGATCGGCAA GACGGCGCTG GCGGTGCATG TCGCGCAGCG GCTGACGGCC
AGCTACCCCG GTGGGCAGCT GTTCATCGAC CTGCACGGTT TCACCGAGGG CGTCACGCCG
GTGACGCCGG GGCAGGCCCT GGATCGCATG CTGCGGACGC TGGGGGTGGC GTTGCAGCAG
ATCCCGCCCG ACGTCGACGA ACGCGCGGCG CTGTACCGCA GCCTGCTGGC GGACCGGCGG
ATGCTCATCG TCCTGGACAA TGCGGTCAAC GAGGCCCAGG TGACGCCGCT GCTGCCGGGG
GCTTCGGGGA GCCTGGTGCT GATCACCAGT CGTCGGCGGC TGGTGGGGCT GGAGGGGGCG
CAGTATCTGC AACTGGACGT GCTGTCGCCC GACGAGGCCG TGTCGCTGCT GCTGCGACTG
GCCGAGATCT CGCAGCCGTC CGACGCGGAT CGGGAGCTGG CGGCCGAGAT CGTGACGCTG
TGCGGGCGGC TGCCGCTGGC GGTCCGGATA GCGGCGGCGA AGCTGCGGCA CCGTCGGCAC
TGGTCACTGC GGACCGTCCG GGACCGGCTG CTGGATGAAC GCGACCGGCT GCACCAGCTG
GAACTGGGTG AGCGCAGCGT GTCGGCGGCG TTCACGATGT CTTATGAGGA CATCGATGCC
GAGGCGCGGC GAGTCTTCAG GTTGTTGAGC CTGTTCCCCG GTTCCCATTT CGACGTACTG
GTCGCGGCGG CGCTGGCCGA TCGCTCCGTC GCGGTGGTCG AGGAGCTGCT GGACGTCCTC
ATCGAGGCCA ATCTGCTGAC GGTGCTGGGG CCAGGCCGGT TCGCGTTCCA TGACCTGTTG
CGGCGGTTCG CGAACCAGGC CCACGAGGCG GCGGCCGACT ACGCGACTGA GACGGCCGAG
CTGCACAGCA GGTTGCTCAA CTACTACCGC CACGCCGTCT ACTCGGTGGC GACCACCATC
GACCCCGGCA TGGTGAACCT CGCCGAACCC CCACAGACCG CCTTGGCGCT TCCCGAACTG
CCCACCACCG AATCGGCTCA CGACTGGTAC CAGGCCGAGC ACCTCAACGT CTTCGCCGTC
ATCGACCTGG CACCCGACTG GGGGCTGGAC GAACAGCTGT GTCAGCTGGT CAACGCGGTG
AGCACCGTGT CGATCATCTT CTCGCACTAC CAATGGCAGT ACGACATGTG CGAGCGCGGC
CTCCAGGCGG CGCGGCGCTG CGGCGACCGG GACTGCGAGG CCCGGCTGTT GAACCACCAG
GCGCTGGCGC TGCGAAAACT CGACCGCGTC CCCGAGGCCG TCGCACTGCA CGAACAGTCG
CTGCTGCTGC GGCGGGAACT GGGCGACAGA CTTGGCGAGG CCGCGATACT CAACAATCTC
GGTCTCATCC ACCGGCGAGC CGGTGCCCTC GACAAGGCGA TCGCGACCTA CGAACAGGCG
CTGCGGCTGG GTGACGACGC GCGCATGATG TCCATACACG CGCTGCTGCG CAGCAATCTG
GCCGAGTGCT GGATCCGGCT GGAGCGTTTC GACGCGGCGC TGGAGCAGGT GCGGCTGGCC
GAACCGATCA TCACCGAGCT CGGCAGCGAA CGCCAGACCG CCCGGCTCCA GCACTACTAC
GGGTCGGTGT ACTACCACCT CGGCGAGTAC GACCGGGCGC TGCGGCACTT CGCCCGGTCT
CTCGAATACT GCGAGCGGGT CATGGAACCG TACGGGCATG CCAGTGTCCT CAACGGCATG
GCCAATGTGT ACCGGGATCG GGGCGACATT CCCACCGCGG TCGACTTCCA TGAGCGTGCC
CTGCTGCTGT GCCGTTCGAT CAGCGACACC GATCTGGAGG CGCTGATCCT GTACGGCCTG
GGTCGCACCC ACCGCGCGGC CGGTCACCGC GAGCTGGCGT TGAGCAACCT GCGCGCGGCC
GTCGCGGCGG CCACCCAGAC CGGGGACGCC TACCAGCTGG AACACGCGAA CCGGGAGCTG
GCCGACACGC AGGCGGCGGA CCATCCGCAG CGGCTGGAAT CCAGCGACAC CCCGGCCGTC
AACTGA
 
Protein sequence
MQFSVLGPVS VSTSEGPVTV EAPKQRLLLA HLISRTNHPL PPDSLIELLW SHNPPSSARK 
ALAWHVMQLR NVLGGKERLA WHGNGYVLNT ERDEVDAARF EELHRQATAV RDADPRRAAQ
LLNQALGLWR GTAYGELPES GALLEEANRL NELRLVALEA RCDIDLELGR HGDIVPELTG
LVADHPFREK FRAALMLALY RCGRQADALR SYREGRTRFA EELGLEPGPS LRQLEQRILN
ADPGLDAPVA ETATIASVVP AQLPADLRSF TGREPEVARL LDLSTVDTNR PGAIVVGALD
GMAGIGKTAL AVHVAQRLTA SYPGGQLFID LHGFTEGVTP VTPGQALDRM LRTLGVALQQ
IPPDVDERAA LYRSLLADRR MLIVLDNAVN EAQVTPLLPG ASGSLVLITS RRRLVGLEGA
QYLQLDVLSP DEAVSLLLRL AEISQPSDAD RELAAEIVTL CGRLPLAVRI AAAKLRHRRH
WSLRTVRDRL LDERDRLHQL ELGERSVSAA FTMSYEDIDA EARRVFRLLS LFPGSHFDVL
VAAALADRSV AVVEELLDVL IEANLLTVLG PGRFAFHDLL RRFANQAHEA AADYATETAE
LHSRLLNYYR HAVYSVATTI DPGMVNLAEP PQTALALPEL PTTESAHDWY QAEHLNVFAV
IDLAPDWGLD EQLCQLVNAV STVSIIFSHY QWQYDMCERG LQAARRCGDR DCEARLLNHQ
ALALRKLDRV PEAVALHEQS LLLRRELGDR LGEAAILNNL GLIHRRAGAL DKAIATYEQA
LRLGDDARMM SIHALLRSNL AECWIRLERF DAALEQVRLA EPIITELGSE RQTARLQHYY
GSVYYHLGEY DRALRHFARS LEYCERVMEP YGHASVLNGM ANVYRDRGDI PTAVDFHERA
LLLCRSISDT DLEALILYGL GRTHRAAGHR ELALSNLRAA VAAATQTGDA YQLEHANREL
ADTQAADHPQ RLESSDTPAV N