Gene Snas_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4247 
Symbol 
ID8885448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4541719 
End bp4542951 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content57% 
IMG OID 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_003512989 
Protein GI291301711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.710263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACCT CGCACTGTGA TGTATACGAC TTGCGATCGG TCACATCCTG TCACACAGAA 
TCAAGAAACG CAATGTGCGT CTTTCGTTTG GGCGTGGTTA GATTTTCGTA CTGTGAGTTG
ATGCGGCACA ATGAGCGCAT GACAGCACGT AAGAGCCTGA CCGTTCGCAA GCGTCGCTTG
GTGCGCGCGC TTCGTCAGCT CCGCAAGGAT TCCGGCATCA CTCTTGAGAA AGCGGCCGAA
CATCTGGACA TCAACCACAC CAGCCTGTCG AGGATTGAGA CCGGCGTCGC AGCCGTGAAA
CTGCCGTACG TTGAATCTTT GCTGCGGCTG TACGGGGTGC CCGAAGCCCG ACAAGAAGAG
CTACTACAGC TCACGCGAGA GGCAAAGCAA CGGGGCTGGT GGCAGGCATA CAAGGACATC
CTGTCCAGCG AGTATGCGGA CTTCATTGGG TTTGAGACAG AGGCGAACGA AACCCGAACG
TACGAACTTG ACACCGTCCC CGGTCTCCTA GAGACCGAAG ACTACGCTCG GGCGTTGATT
TCGGCGCAGC TTCCCGGCGC GACGGCGGAA GACATCGAAA AGCGGGTGAA GTTGAGGGCG
AGCCGTCAAG ATCGGCTCAA AGAAGACCCG AAACTCAGCG TTTGGGCCAT CCTCGGAGAA
GCTGCGCTGC GATATCAGGT CGGCGGAATG AAAGTTCTAC GGGCACAGCT TGAATATCTA
CTGCAACTTC AGCGCGAGCC CAACATCACG ATTCAAGTTC TCCCGTTTTC CGCAGGTGCG
CACCCTGGGA TGGCTGGTCC GTTCGTAATC CTCGGGTTTG ACGACGATCC TGACATCGTT
TACCTCGAAG GGCTCACCAG CGCGCTTTAT CTCGAAGATC TTGGCGAGTT GGAGCGCTAT
AAGATGGTTT TCGAGCGTCT TCTCGCCGAG GCTTTGAGCC CTGCGGCGTC CGACAGGCTC
ATTCGAGAGG CATCAAAAGA GTTATGCCAC TTTGCAAACA ACGGAAGGAG GCGCGGGATG
GCCGCGCAAG GTGACGACCT GGCCCGTGCC CGATGGCGCA AGGGCAGACG AACCCAAGCC
AACGGCAACT GCGTGGAGGT TGCCCTAGTC GAGTCAGTCT ACGTGCGTGA CTCCAAGCTC
GACACTACAG GAACGTTCCC CACACTCTCA GTCTCAAGTA CCGAGTGGAA GAACTTTCTT
CTTGCAATCG CCAACAACGA CAAGACCGGC TAA
 
Protein sequence
MVTSHCDVYD LRSVTSCHTE SRNAMCVFRL GVVRFSYCEL MRHNERMTAR KSLTVRKRRL 
VRALRQLRKD SGITLEKAAE HLDINHTSLS RIETGVAAVK LPYVESLLRL YGVPEARQEE
LLQLTREAKQ RGWWQAYKDI LSSEYADFIG FETEANETRT YELDTVPGLL ETEDYARALI
SAQLPGATAE DIEKRVKLRA SRQDRLKEDP KLSVWAILGE AALRYQVGGM KVLRAQLEYL
LQLQREPNIT IQVLPFSAGA HPGMAGPFVI LGFDDDPDIV YLEGLTSALY LEDLGELERY
KMVFERLLAE ALSPAASDRL IREASKELCH FANNGRRRGM AAQGDDLARA RWRKGRRTQA
NGNCVEVALV ESVYVRDSKL DTTGTFPTLS VSSTEWKNFL LAIANNDKTG