Gene Snas_5586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5586 
Symbol 
ID8886801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5935780 
End bp5938191 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003514309 
Protein GI291303031 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCCGG TTTCGTCGTT CTTCGTGACT CGCTGGGCGA CCACGGCGAC GGACCGGGCG 
GCCCGGCGGT TCAGCGAGGC GATCCGGGGC ACCGAGGTGC AGCGCGCGCT GGTCACCGCC
GTCAAGAACG CCATCCCCAA GGCCGCCGCC CGGCTGCACC CCGACGACGA CTCGCTCGCC
GGACACGTCG CGGACTGCCT GTGGGAACGC GATTCCCACG CATTGCCGCT GGTGGACGGA
ACCGCCCTGA TCGATCTGGC CGACGCCGTG GCGCCCTGGG TCGACGAGGT GTACACGCCC
GTCGGCGAGA CCCCCGATGC CGGACCCGAA CCGTCGCTGT TGGCGCGGCT GCTGCGCGAG
GAGATCATCG CGGCGGTGCG CGCCGAGGCC ACGGCCGGGG ATCGGGTGCT GCACGCGCTG
TGGAGCGACT ACCAGAACGA GTCGATCCTG CGCCGGGTCG CGCGGCCCCG GGCCACCGAA
CCGGGTCGGC TGGACTGGGG ACGGCTGCCG CTGCCGGGCA CCTTCGTCGG GCGGCGCGAA
CAGCTGGAGA CCCTGGCGGA CATGCGTTCC GAGTCCGGGC TCGGCCTGAT CGTGTCGGTG
GGTGGATTCG GCGGCATCGG CAAGACCGCA CTGGCGTCCT GGTTCGCGGC CAGTGTCCGC
TGCTCCTACC CCGACGGCTG CGTGTTCGTC GACTTCCAGT CGTACCCGTC GCCGGACCAG
GCCGTGTCGT CGTACGCGGC GCTGGGGATG TTGCTGGAAC ATCTCTTCGG GCTTGAGGCA
GCGACGGTGG CCAGAATGGA CCTGGCCGCG CGCGGCGCCG AATGGCAGCG ACGCGTCGCC
GGCAAACGCA TGATCTTCGT GTGGGACAAC GTGTCCAGCA TCGACCAGGT GGAGCCGCTG
CTGGTCCGCG ATCCCGACTG TCTGACGCTC ATCACCTCCC GGGAGAGGTT CCACTGTGCC
GGGTCGCGAA CCCTGGACCT GGACGTGCTC GACCACGACG CGGCCGTGGC GATGTTCGTC
GCGGTCGCCG GTCCCCGGCT GGCGGCCGAC GAGGCGGCGG TGTCGCGAGT GGTGACGGCC
TGCGGACACA TGCCCGTCCT CATCGGATTG AAGGCCGCCG ACATCGCCTG CGGCACCGGC
AGTCTGGACC GGATCGAAGT CCAGCTGAAC AACCTGCCGT CCGCGCACAC CAAGGCCGGA
CTGTACGCCC GCATCGACAG CTCGTACGCG GCCCTCACCG CCGAACAGCA AGCGGCGTAC
CGGTTGCTGG GCAAACACCC GGGCCGCTAC CTCACCGTCG GCACCACCAC GATCGTCCTG
TCGCGACTGA TCGGCCGCTC GGTGACGGTC GCCGAAGCGG TGGCGTTGCT GGACGGCCTG
GTCGCGCACC GGCTCGCCGA GCCGATGGTC ACCGACCCAC CCGCCAGCAT CGCCGAACAG
CTGGCGTACA CCGCCCACGA CATCCTGCTC GACCACGCCG CCGGTCTCCC CGCCGCGACC
GGAGAACCGG AAATACTGTC CACAGTTTGC GACTATTACG CCGACCGGCT GGCGCACTAC
GACTTCGTCG GCGACCTGGC GTGGTTCCGT ACCGAACGGT CCGCCCTCGT CGCCGCGCTC
ACCAGCCCGG CGGCGCGACC GCACGCCACC CTCAACCTGG CCCAGCTGAT GCTGACCAAC
GACAACATCG GCCTGGCCAT CGACATCCTG CGTGAACTGC TGGACGGATT CGTCGCGACC
GGCGACAGCC GCGGCGAGGT CAACACCCTG CAGACACTGG GACAGGCGGT GCTCGTGACC
GGTGACCATG AGGGCGCCCG CGACCTGTTC ACCAAGGCCG GACACCTCGC CGAGGAACAC
GGCTTTCGGG ACTGCGCGGC CCACGCCTAT CTCGGACTGG CACAGGTCAA GGCGCTCAAC
GACTCCCCGG TCGGCATCGC GGACATGTTC GAGCGGGCGC TGGACATCTT CGTCGACCTG
GACGACGTCA CCAACGCGAC CAACGCGCTG GCGGGAATGG CCCAGGTGGC GATGCTGCAA
CAGGACTTCG ACGCGGCCCG CTCCATCTTC GTGGAGATCG AGGACACCTC CGAGGTCCGG
GCGTACGGCA TGGGCGTCGC CCTGGGGGTG CTCGGCCAGG CCCACGTCGC CTACCTCGAC
GGCGACAACG ACGAGGCGAT CCGGCTGTAC CAGCGGGGCG AAGAAGCGTC CGAGTCCGTC
GGCTACCGGT TCGGCGTGGG CATGGCCCGG GAGTGCCTCG GCGAGATCGC CGCCGAGACC
GGCGACGTGG AGGGCGCGAT CGCGCACTTC CGCGGCGCGC TGACAGTGTA CGTCGAGATC
GGTTCACCGT CGCAGCACGA AGTGCGCGAA GCCCTGCGGA ACCTGGGGGT CCCGGCCGAA
CCACCGGACT GA
 
Protein sequence
MDPVSSFFVT RWATTATDRA ARRFSEAIRG TEVQRALVTA VKNAIPKAAA RLHPDDDSLA 
GHVADCLWER DSHALPLVDG TALIDLADAV APWVDEVYTP VGETPDAGPE PSLLARLLRE
EIIAAVRAEA TAGDRVLHAL WSDYQNESIL RRVARPRATE PGRLDWGRLP LPGTFVGRRE
QLETLADMRS ESGLGLIVSV GGFGGIGKTA LASWFAASVR CSYPDGCVFV DFQSYPSPDQ
AVSSYAALGM LLEHLFGLEA ATVARMDLAA RGAEWQRRVA GKRMIFVWDN VSSIDQVEPL
LVRDPDCLTL ITSRERFHCA GSRTLDLDVL DHDAAVAMFV AVAGPRLAAD EAAVSRVVTA
CGHMPVLIGL KAADIACGTG SLDRIEVQLN NLPSAHTKAG LYARIDSSYA ALTAEQQAAY
RLLGKHPGRY LTVGTTTIVL SRLIGRSVTV AEAVALLDGL VAHRLAEPMV TDPPASIAEQ
LAYTAHDILL DHAAGLPAAT GEPEILSTVC DYYADRLAHY DFVGDLAWFR TERSALVAAL
TSPAARPHAT LNLAQLMLTN DNIGLAIDIL RELLDGFVAT GDSRGEVNTL QTLGQAVLVT
GDHEGARDLF TKAGHLAEEH GFRDCAAHAY LGLAQVKALN DSPVGIADMF ERALDIFVDL
DDVTNATNAL AGMAQVAMLQ QDFDAARSIF VEIEDTSEVR AYGMGVALGV LGQAHVAYLD
GDNDEAIRLY QRGEEASESV GYRFGVGMAR ECLGEIAAET GDVEGAIAHF RGALTVYVEI
GSPSQHEVRE ALRNLGVPAE PPD