Gene Snas_4807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4807 
Symbol 
ID8886014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5102842 
End bp5104002 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003513541 
Protein GI291302263 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.525155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGG CCGCCACGAT TCAGAAGATC GAACGCGCCG CCAGCGCCCT GGCCACCCAG 
AGCGTGGCCC GCATGGACGC CGAACTGCCG TGGTTTCGCG AGCTGCCCGC CGAACAGCGG
GCCATGGTCA CGCTGGTCGC GCAGGCCGGG GTCGGCTCCT TCGTGGAATG GCTGCGCGGC
GACGGTGAGG CACCGGCCGT CGGCGACGAG GTCTTCGACG GCGCCCCGCG CGAGCTGGCC
CGCCTGATCC GGTTGCAGCA CACCGTGGCG CTCATCAAGG TCACCATCGA CGTCGTCGAG
GAGCAGGTGC CGCACCTGGC CGCGCCCGGG GAGGAGGAGG CGCTGCACAT CGCGGTGCTG
AAGTTCTCCC GCGAGGTCGC CTTCGGTGCC GCGCGGGTGT ACGCCCGCAC CGCCGAGACC
CGGGGCGCCT GGGACGCCCG GTTGCAGGCG ATGCTGGTCG ACGGGCTGCT GCGCGGCGAC
GACGGCGACG AGATCGCCGG ACGCGCCGCC GCGCTGGGCT GGGGCGACTC GTCCCCGGTC
GCGGTGGTGG TGGGCCGCTC CCCCGGCGGC GAGGCCGCCG TCATCCTGCA CGCGGTGCAC
CGGGCCACCC GCCGGATGGG CATCGACGTG GTCGCCGGTG TGCACGGCGA ACGGCTGATC
CTGGTGCTGG GCGGAAGCAC CGAACCCGAG GAGGTCGCGG GCAAACTCGT CGGGCAGTTT
GGCGAGGGCC CGATCGTGGT CGGACCGGCG ACGCCGAGCC TGGCCGAGGC GGGCGCCTCG
GCGCGGGCGG CGCTGTCGGG ACACCGGGCG GCACCGGCCT GGCCGGGCGC GCCGCGTCCG
GTCTCGGCGC ATCAACTGCT GGCCGAGCGG GCGCTGGCCG GGGACAACGA GGCCCGACGG
ATCCTGCGCA TCGACGTCTA CAACGCCCTG GAACGCGCCG GTGGTTCGCT GCTGAGCACC
GTGGACACCT TCATCGCCAC CGGCGGCGTC CTGGAGGGCA CCGCGCGGGC GGTGTTCGTG
CACCCCAACA CGATTCGCTA CCGGATGCGC CGGGTCGCCG AAGTGACGGG TTTCTCCCCG
TTTGTCCCCC ATGACGCTTT CACTCTGCAC GTGGCCTTGA CCATAGGTCG CCTGGATCCC
ACCAGTGACG TCATACGTTA G
 
Protein sequence
MELAATIQKI ERAASALATQ SVARMDAELP WFRELPAEQR AMVTLVAQAG VGSFVEWLRG 
DGEAPAVGDE VFDGAPRELA RLIRLQHTVA LIKVTIDVVE EQVPHLAAPG EEEALHIAVL
KFSREVAFGA ARVYARTAET RGAWDARLQA MLVDGLLRGD DGDEIAGRAA ALGWGDSSPV
AVVVGRSPGG EAAVILHAVH RATRRMGIDV VAGVHGERLI LVLGGSTEPE EVAGKLVGQF
GEGPIVVGPA TPSLAEAGAS ARAALSGHRA APAWPGAPRP VSAHQLLAER ALAGDNEARR
ILRIDVYNAL ERAGGSLLST VDTFIATGGV LEGTARAVFV HPNTIRYRMR RVAEVTGFSP
FVPHDAFTLH VALTIGRLDP TSDVIR