Gene Snas_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3251 
Symbol 
ID8884450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3430425 
End bp3431711 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content65% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003512014 
Protein GI291300736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.292164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0559218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTG GAACCAGCCG CCGTACCCTC CTGTCCCTCG GCGTGGCGGT GCCGATCCTG 
GCCGCGACCG GATGCGGACG GCAACCCGGC GCCGCCGGCG CCACCGCCTG GGCCCTGACC
GGCGGATCCG AGGACGCCAC CCGCGACTCC TTCAAAGCCT GGAACAAGAG CCACCCCGAC
GAGAAGATCA CCGTCGAATG GTTCGCCAAC GATGACTACA AGGAGAAGAT CCGCACCGCC
GTCGGCTCCG GTAACGCCCC AACACTCATC TTCGGCTGGG CCGGGGCCCT GCTGGCCGAC
TACGTCAAGA ACAAGCACGT CATCGACCTG ACCGAGGACG TCGAGAAGCT CAGTAAACGG
CTGCTGCCCT CGGTGGCGGC CGTCGGGCAG ATCGACGGCA AGACCTACGC CGTTCCCAAC
AACCAGACCC AGCCGATCGA GATGTTCTAC AACACCGAGG TTCTGGGCAA GGCCAAAGCC
GAGCTCCCGA AAACCTGGGA CGAACTGCTC GATGCGGTCG ACAAGCTCAA ATCCGCCGAT
GTCATCCCGA TCGCGCTGGC CGGGCAGAGC GTGTGGCCCG AACTGATGTG GATCGAATAC
CTCGCCGACC GGATCGGCGG GCCGGAGGCA TTCGGCCGCG TCCTCGACGG CGAGAAGGGC
GCCTGGTCGC ACCCGGACAT GCTGGACGCC CTTGACAAGG TCACCGAACT GGTGGAGCTG
GGCGCCTTCG GCGACAACTA CGGTTCGGTG GAGGCCGACG CGGGTGCCGA CACCGCGCTG
GTCTACACCG GCAAGGCCGG GATGATCCTG CACGGCAGCT GGGTCTACCC CGACTTCATC
AAGAACGCCC CCGAACTGAT CAAGAAGGAC GGTCTGGCCT ACTCGACCTT CCCGACCGTC
AAGGGCGGCA AAGGCGATAA ATCAAATGTG GTCGGCAATC CGGCGAACTT CTGGTCGGTG
TCGTCCTCGG CCACCGAAAC CCACCGAAAC ACCGGCACCT CATACCTGGG CGGGGACCTG
TTCGACGACG ACCACATCGA CTCGCTGCTG GCCGTCGGGG CCGTACCCCC GGTCACCGGC
ATCGAGGACA AGATCGGCGA GGCCGACAAC GCCGACTACC TGGACTTCAC CTACGGCCTG
GCTCGCGACG CCAAGCACTT CGAACTGTCC TGGGACCAGG CGCTACCGTC CAAACAGGCC
CAGGAACTGC TGGAAAACCT GAGCAAACTG TTCCTCGGCG ACGTTGGTGC CGAGGACTTC
GCCAACGCGA TGGACAAGAC CCTGTGA
 
Protein sequence
MKRGTSRRTL LSLGVAVPIL AATGCGRQPG AAGATAWALT GGSEDATRDS FKAWNKSHPD 
EKITVEWFAN DDYKEKIRTA VGSGNAPTLI FGWAGALLAD YVKNKHVIDL TEDVEKLSKR
LLPSVAAVGQ IDGKTYAVPN NQTQPIEMFY NTEVLGKAKA ELPKTWDELL DAVDKLKSAD
VIPIALAGQS VWPELMWIEY LADRIGGPEA FGRVLDGEKG AWSHPDMLDA LDKVTELVEL
GAFGDNYGSV EADAGADTAL VYTGKAGMIL HGSWVYPDFI KNAPELIKKD GLAYSTFPTV
KGGKGDKSNV VGNPANFWSV SSSATETHRN TGTSYLGGDL FDDDHIDSLL AVGAVPPVTG
IEDKIGEADN ADYLDFTYGL ARDAKHFELS WDQALPSKQA QELLENLSKL FLGDVGAEDF
ANAMDKTL