Gene Snas_5507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5507 
Symbol 
ID8886721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5847233 
End bp5848981 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content61% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003514231 
Protein GI291302953 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.553842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.243799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGT CACTGGCGAC AGTCGGTGTA CTCGCCTTGA TGGTGAGCAC CGTCGCCGCC 
TGTAGCTCGA ATGAAGGCGG GAAGGACGAA GGCACCAAGG ACTTCGAGGT CAAGCCCGCT
GTCATCAGCA AGGATGCCAA GGATTCCGAG GGTCCGGCCA GCGAGGTCAA GGGTGCCCAG
ACAGGCGGGG AGGCCACCTA TCTGGCCCCC ACGACCTTCG ACCACCTTGA CCCTCGTCAG
ACCTACTACG TCAACACCCT TGAGATCGGC CGCCTGTTCT CCCGTCAGCT GACCAGCTAC
CGGGTGATGG GCGAGGAGAC CAAGGTCGTC GGCGACCTCG CCACCGGTCC CGGTAAGGAC
CTCGGCGACT GCAAGGCCTG GGAATACGAG CTCAAGGACG GCCTGAAGTA CGAGGACGGT
TCGCCGATCA AGGCCGACGA CATCGCCTAC GCGATCTCCT CGACCTTCGA CAGCCGACTG
CAGGACGGTC CCTCGTCCTA CTTCCGTGGC TGGCTGAAGG GTGCCGAGAA GTACAAGGGC
CCGTTCAAGG ACAAGGGTTC CCGCGCTCCG GGCATCAAGG TCGACGGTGA CAACAAGATC
ACCTTCGAGC TGAGCTCCCC GCACTGCGAC CTGCCGTACA TGGCGGCCAT GAGCGTCACT
TCTCCGCTGC CGGAGAAGAA GGAGGCCAAG AACCCGGCCG ACTACGACTT CAAGCCGTTC
TCCTCGGGCC CGTACAAGTT CGAGGGCAAG TGGAGCGAGA ACAAGGGCGT CACCCTCGTC
AAGAACGAGA ACTGGGACCC CAAGACCGAC CCGATCCGTC ACCAGTACGT CGACACCTTC
AAGGTGAACT TCGGTGACAA CCACAAGGCC ACCACTGACG CGCTGCGTGC CGACAAGGGC
GCGGACGCCA CGTCGATGAC CGACACGGTC GACATCAACC AGGTCCCTGA GATCGTCAAG
GACAAGGAAC TGATGAAGCG GGTCGAGAAC GTCCCGGGCA TCTTCGTGTA CTGGATGGGC
ATCAACAACA TGAAGATCAA GGACCCCGAC GTCCGCAAGG CGCTGGCGTA CGCGGTCGAC
AAGGAGGCCA TCGTCAAGGC CACGGGTGGA TCGAGCCAGG CCACGCCCGC CTCGACGACC
CTGAGCCCGA CCGTCGCCGG TTACGAAGAC CAGATGGACA TGTACAAGGG TCCGAAGGGC
GACAAGAAGA AGGCCAAGGA GCTGCTCAAG GGCAAGGACG TCAAGTCGCT GACGTACGCC
TACCGTGCCA GCCCCGCCAA CAAGAAGATC GCCTCCTCGC TGCAGGACCA GCTCAAAGAG
GTCGGCATCG AGCTGAAGAT CAAGGAGCTG AGCGAGACCG AGGCTCCGTC GATCCTGAGC
GACCCGCAGG AGAACAAGTA CGACCTGTAC ATGAAGAACT GGGGTGCTGA CTGGCCCACC
GGTTACAGCG TGCTGCAGCC GATCTACGAC GGCCGCACCA TCACTGACGA CCCGGGCAAC
GTCAACAACA TCTGGTTCGA CGAAAAAGAG GTCAACGACC AGATCGACAA GGTCATGAAC
ATGACCGACC CCGAGGAGCA GAACAAGGCC TACATGGATC TCGACAAGAA GATCCTGGAG
GAGTACATGC CGATGGTCCC GCTGTACTAC AGCAAGACCT TCGCGATGCA CGGTTCCAAG
GTGGGCGGTC TCTACTCGAC CAACACCACT GGTACCACCT CGTTCACCGA CGTCTTCGTC
AAGTCGTAA
 
Protein sequence
MRKSLATVGV LALMVSTVAA CSSNEGGKDE GTKDFEVKPA VISKDAKDSE GPASEVKGAQ 
TGGEATYLAP TTFDHLDPRQ TYYVNTLEIG RLFSRQLTSY RVMGEETKVV GDLATGPGKD
LGDCKAWEYE LKDGLKYEDG SPIKADDIAY AISSTFDSRL QDGPSSYFRG WLKGAEKYKG
PFKDKGSRAP GIKVDGDNKI TFELSSPHCD LPYMAAMSVT SPLPEKKEAK NPADYDFKPF
SSGPYKFEGK WSENKGVTLV KNENWDPKTD PIRHQYVDTF KVNFGDNHKA TTDALRADKG
ADATSMTDTV DINQVPEIVK DKELMKRVEN VPGIFVYWMG INNMKIKDPD VRKALAYAVD
KEAIVKATGG SSQATPASTT LSPTVAGYED QMDMYKGPKG DKKKAKELLK GKDVKSLTYA
YRASPANKKI ASSLQDQLKE VGIELKIKEL SETEAPSILS DPQENKYDLY MKNWGADWPT
GYSVLQPIYD GRTITDDPGN VNNIWFDEKE VNDQIDKVMN MTDPEEQNKA YMDLDKKILE
EYMPMVPLYY SKTFAMHGSK VGGLYSTNTT GTTSFTDVFV KS