Gene Snas_2843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2843 
Symbol 
ID8884042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2991868 
End bp2993073 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003511611 
Protein GI291300333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.397563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGAC CCGATCCCTC GACCGCCACC CAGCTCGACC TCATCGCGGG CCACTTCGGC 
AAACGCCTCA AGTACTGGCG GCTCACCCGG CAGCTGACCC AGGCCGAACT CGCCCGCGAC
CTGAACCTGG ACGGCTCCTA CGTCTCCAAA CTCGAAAGCT CCCGCCGCCG CCCCAGCCTC
GACATCGCCC GACAGTGCGA CGACCTGCTC GACACCGGCG GCGAACTGGC CGACCTGCTC
ACCCTCGTGG CCACCGACCC GGGCCCACCG GTGGCCACGG TCGGGGCTCC GCTGCCCACC
ATCTCCCCCA CCACCGCGCG CACCACGGCC CTGCCCGCCG CGGCCCCCGC CCACGCCACG
GTCTCCCTCA ACCGCCTCGC CGAGGCCTAC GCCGAGGTCG CCGCCACCAT GGGCGGCCAC
CACCTCGGCG AATCCGTCGA ACGCCAGGCC CAGGAGATCA TCGGCCGCCA CATCGGCAGC
CCCGAGTCCC TCTCCGGCGG CCTGCTGCGC ACGGCGGCCC GCTTCGCCCG GCTGGCCGCC
GCCATCCGCC TCGACTCCCT CGACGAGGCC GGAGCCCTCT ACTGGAACGA CTGCGCGGGC
CGCTGGGCCC TCGACGGCGG CGACCCCGCC CTGTCGGCCG AGATGTGCGC CCGCACCGCC
ATCGTCTACG CCCACCGCGA CAACGCCCCC ACCGCTCTCA CCCTCGCCAC CCGCGCCGAA
CAACTGGCCC CCCACGCGCC CACCGCCACC GTCTGGTCCC TGCTCGCCCA GGCCCACGCC
CACGCGGCCT CCGCCGAGCC CGACCAGACC ACCGGCGCAC TCGCCACCGC CCACAAGCTA
CTGACCGAAC TCGACAGTCC ACTGATGGCA AACCCGTCCA CCTACAGCGA CAACCACCTC
TGGCACTGGC ACGCCGGACT CTGCCACCTC ACCCTCGCCC GCCACGACAT CGACCGCACC
ACCAACGCCA ACCGCGCCCT GGACCAACTC CGGCAAGCAC TGTCCGAAGT GTCCGTCTAC
CACACCCGCG AACTGGCCCT CACCCGTCTG GCCCTGGCCC ACGCCTACCT CCACGCCGAC
GACCCCGTCT CGGCCACCGC CGAACTCACC GAAGCCGCCA CCCTGGCCCG CGCCTGCACG
TCACCCCGCC TCCACACCGA GCTCGCCCAG ACCACCACCA TCCTGGCCAC GACCACCCAT
AAATAA
 
Protein sequence
MPGPDPSTAT QLDLIAGHFG KRLKYWRLTR QLTQAELARD LNLDGSYVSK LESSRRRPSL 
DIARQCDDLL DTGGELADLL TLVATDPGPP VATVGAPLPT ISPTTARTTA LPAAAPAHAT
VSLNRLAEAY AEVAATMGGH HLGESVERQA QEIIGRHIGS PESLSGGLLR TAARFARLAA
AIRLDSLDEA GALYWNDCAG RWALDGGDPA LSAEMCARTA IVYAHRDNAP TALTLATRAE
QLAPHAPTAT VWSLLAQAHA HAASAEPDQT TGALATAHKL LTELDSPLMA NPSTYSDNHL
WHWHAGLCHL TLARHDIDRT TNANRALDQL RQALSEVSVY HTRELALTRL ALAHAYLHAD
DPVSATAELT EAATLARACT SPRLHTELAQ TTTILATTTH K