Gene Snas_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2013 
Symbol 
ID8883205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2136151 
End bp2137422 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content65% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003510801 
Protein GI291299523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.991542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0817819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGA AGATCGCCAC CCTGGCGGCA CTGGTCATGT TGGCGGCCAC CGGATGCGGA 
CTCTCCGGCG AGGACTCCGG CGGCGACGTG GACGTGTCCG GCGAGGTCAC CGGCAAGGTC
TCCCTCCAGA CCTGGGCTCT GAAACCCAAG CACACGAAGT ACGTCGAGAA ACTCATCGAC
GGTTTCGAGG ACAAGTACCC CGGCACCAAG GTCAAGTGGC TGGACCAGCC GGGCGACGGC
TACTCCGAGA AGGTCCTCAA CCAGGCCGCC AACGACGAAC TGCCCGATGT GGTCAACCTG
CCGCCGGAGT TCGCGCTGCC GTTGGCGGAC AAGGAACTGC TGCTCGACGT CGCCGATACC
GACGACAAGC TGAAGAAGGA CTACCTGTCC GGCGCCATCG ACGCGTATCG CTTCCCCGGT
GTCAAGGGCG CCTACGGCTA CCCGTGGTAC CTCAACACCG ACGTCAACTA CTGGAACGCC
GACCTGATGA AGAAGTACGG CCTCGATCCC GACAAGCCGC CGACGTCGTT CGAGGACCTG
GTGGCACAGG CAAAGACCAT GAAGAAGAAG TCCGACGGCA AGATGTTCCT GATGAGCCGC
AAGCCATCCT GGGAGGACCT CACCAACGCG GGCGTCGAGA TCCTGTCGTC CGACGGCGAG
AAGTTCACCT TCAACACCGA CGCGGCCGCC GAACTCCTCG ACGGCTACCG CGACGCCTAC
GCCGACGGCC TGCTACCCGA CGACGTCCTC ACCGACGCCT ACCTCGGCAA CAGCGAACTG
TTCAAACAGA AGAAGGTCGC CTGGTCCACC GGCGGCGGCA ACTTCATCAA CGACGTCAAG
GTCGACAACC CGAAGCTGGC CAAGGACATC GTCCCGTCCA AGGCCCTGGA CACCCCGCCG
CTGTACGTGC AGGGGCTCTC GGTGTCCAAG AAGAGCGACA ACCTGCCCAC CGCGGTCGCC
CTGGCCCGCT GGGTGACCAA CGCCGACAAC CAGGCCGACT TCGCCGAACG GGTGCCCGGG
ATCTTCCCGT CCACCACGGC TTCGGCCGAC GACGAGTCGT TCTCCGAAAG CGACGGCAGC
GGCGCCGGTG ACGCCAAGAA GCTCGCTTTC GAGTCGCTGG CCGAGGCCGA ACTGCTCAAG
CCGGTCGTCG TCGACGACGC CATCAACGAC GTGTTCAACC AGCAGATCTC GCTGGCCGTC
AGCGGCGAGA CCAGCTCGAA GCAGGCCCTG GACAAGGCCG CCGAAGAGTG CACCAAGCTG
CTGAACGACT GA
 
Protein sequence
MRAKIATLAA LVMLAATGCG LSGEDSGGDV DVSGEVTGKV SLQTWALKPK HTKYVEKLID 
GFEDKYPGTK VKWLDQPGDG YSEKVLNQAA NDELPDVVNL PPEFALPLAD KELLLDVADT
DDKLKKDYLS GAIDAYRFPG VKGAYGYPWY LNTDVNYWNA DLMKKYGLDP DKPPTSFEDL
VAQAKTMKKK SDGKMFLMSR KPSWEDLTNA GVEILSSDGE KFTFNTDAAA ELLDGYRDAY
ADGLLPDDVL TDAYLGNSEL FKQKKVAWST GGGNFINDVK VDNPKLAKDI VPSKALDTPP
LYVQGLSVSK KSDNLPTAVA LARWVTNADN QADFAERVPG IFPSTTASAD DESFSESDGS
GAGDAKKLAF ESLAEAELLK PVVVDDAIND VFNQQISLAV SGETSSKQAL DKAAEECTKL
LND