Gene Snas_5386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5386 
Symbol 
ID8886595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5717865 
End bp5718932 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID 
Producttranscriptional regulator, ArsR family 
Protein accessionYP_003514111 
Protein GI291302833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAG ATGACGGCGC CCCGTTGGAC AAGGCCGACG CCGACGAGTA CGCCAGCTGG 
TTCAAGGCGT TGTCGGACGG CACCCGCATC CGGATAGTGT CGCTGTTGGC CAAGCGGCGC
GAACCGATGA AGGTCGGCGC CATCGTGGAC GCGGTGGGCG TGGGTCAGTC GACGGTGTCG
CACCACCTGA AGATCCTGGC CGAGGTCGGT TTCGTGCTGG CCCAGCACAG TGGAACATCC
ACCTTGTACC AAGTCAACGA GTCGTGCGTG ACGGCGTTCC CGTCCGCCGC CGATGTCGTC
ATGGGGCGTC CGGCTCCCCC TCGCACCTGC TTAGGAGTAT CCATGTCCCA CAAGAACATC
ATCGACAAAT ACTCGTCCCT GGCCCGCACC GCGCTCGACG GCGCGCAGGT GACCGACTGC
GAACCGCGTG CTTTCGCCGA CGGGAAGTTC GGCGCGGGCG GATACGTGAA GCTGGACGAC
CTTCCCGAAG GCGCGGCCCG CGCCAGCCTC GGCTGCGGCG ACCCGGTGGC GGTGGCCGAC
CTGCGCCCCG GTGACATCGT GCTGGACCTG GGCTCGGGCG GCGGCATCGA CGTACTGCTG
TCAGCCCGCC GGGTCTCCCC GGGCGGCAAG GCATACGGAC TCGACGCCAG CGCCGACATG
GTGGCCCTGG CCCGCCGCCA CGCCGCCGAA GCCGGTGCCG ACAATGTGGA GTTCCTGCTC
GGCGACATCG AGAACATCCC GCTTCCCGAC GCCAGCGTCG ACGCGGTGAT CTCCAACTGT
GCGCTGTGCC TGTCCAGCGA CAAGACCGCG ACCCTCACCG AAGCGTTCCG GGTGCTCAAA
CCCGCGGGCC GCTTCGGCAT CAGCGACGTC GTCGCCCACG GCGAGGCCGA CCCGGCCGAA
CGCCAACGCG TCGAAGCCCA GATCGGATGC GCTGTCGGCA CGCTGACCAC GCACCAGTAC
CGCGACATGT TGTCCGCCAT CGGCTTCGGC GACATCGACA TCACGCTCAC CGCCGACCAC
GGCGCCGGGA TTCATTCCGC GATCGTCCAG GCAATCAAAC CCGGCTAG
 
Protein sequence
MKEDDGAPLD KADADEYASW FKALSDGTRI RIVSLLAKRR EPMKVGAIVD AVGVGQSTVS 
HHLKILAEVG FVLAQHSGTS TLYQVNESCV TAFPSAADVV MGRPAPPRTC LGVSMSHKNI
IDKYSSLART ALDGAQVTDC EPRAFADGKF GAGGYVKLDD LPEGAARASL GCGDPVAVAD
LRPGDIVLDL GSGGGIDVLL SARRVSPGGK AYGLDASADM VALARRHAAE AGADNVEFLL
GDIENIPLPD ASVDAVISNC ALCLSSDKTA TLTEAFRVLK PAGRFGISDV VAHGEADPAE
RQRVEAQIGC AVGTLTTHQY RDMLSAIGFG DIDITLTADH GAGIHSAIVQ AIKPG