Gene Snas_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0203 
Symbol 
ID8881381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp217238 
End bp218563 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003509015 
Protein GI291297737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.346278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.123927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTTC CCGACTCCGG CCCCTTTTCC CGCCGCGCCC TGCTCGGCCT CGCCGCCGGA 
TCCACCGCCG CCATCGCCCT GTCGGCCTGC GGCGGCGGCT CCGACACCGC CGAGGACGGC
AGCCAGGGCG GCACCAAGTA CGACGGCCCC AAAGTGGACC TCGACTTCTG GAACGGCTTC
ACCGGTGGCG ACGGCCCCAT CATGAAGCAG CTGGTCAAGG ACTTCAACGC CGAGCACGAC
AACATCAAGG TCAAGATGAC CACCTACGAG TGGGAGTCGT ACTACGAGAA GGTGCCCGCC
GCCGTGCGCA GCGGCAAGGC GCCCGACATC GGCATCATGC ACGTCGACAG CCTGGCCACC
AACGCCGCCC GCGGCGTGAT CCTGCCGCTC GACGACGTCG CCGACGCCCT GAAACTGTCC
AAAGGTGACT TCGTCGAGCC GGTGTGGAAC GCCGGTGTCT ACGACAAGAA GCGCTACGGC
ATCCCGCTGG ACGTCCACCC CGAGGGCAAC TTCTACAACA AGAAGCTGCT CGACGAGGCC
GGACTCGACC CGGACAATCC GCCCGCCACC GGCGACGACT ACGCCGACGC CCTCGACAAG
CTCAAGAAGG CCAAGATCAA GGGCATGTGG ATGACGCCGT TCCCGTTCAC CGGCTCCCAC
ACCTTCCAGT CGCTGCTGTG GCAGTTCGGC GGCGACCTGT TCAGCTCCGA CGCCAAGGAC
CCCGCCTTCG CCGAGGACGC GGGCGTCAAG GCGCTGACCT GGATGGTCGA CCTGGTCAAG
GACGGCCACA GCCCCAAGGA CGTCGGCCAG GACGCCGACG CGGTGGCGTT CCAGAACGGC
AAGACCGCTT TCAACTGGAA CGGCATCTGG AGCATCAACA CCTTCAACGA CGTCGACGGC
CTCGAATGGG GCGTGGCGCC GCTGCCGCAG ATCGGTGAGC AAAAGGCCGC CTGGGCCGGT
TCCCACAACT TCGTGCTGCT CAAGCAGCGC ACCGTCGACA CCAACAAGCA GGCCGCGTCC
AAGGTGTTCG TCAACTGGAT CAGCGGCAAG TCGGTGGAAT GGGCCAAGGG CGGGCAGGTC
CCGGCCCGCA ACAGCGTCCG CGATTCCAAG GAGTTCGGCA AGCTCACCGA GCAGTCGGTG
TTCGCCGAGC AGGTCGACTA CCTGCACTTC CCGCCCGCCG TGCCGGGCAT CGGCGACGCG
ATGCCGCAGG TCGACAAGGC CGTCAACCAG GCGGTGCTGC TGAAGAAGAA GCCCGCCGAC
GCGCTGGCCG ACGCGGCCGA CAAGGCGGCC AAGATCCTGG CCGAGAACCG GAAGAAGTAC
GGCTGA
 
Protein sequence
MPLPDSGPFS RRALLGLAAG STAAIALSAC GGGSDTAEDG SQGGTKYDGP KVDLDFWNGF 
TGGDGPIMKQ LVKDFNAEHD NIKVKMTTYE WESYYEKVPA AVRSGKAPDI GIMHVDSLAT
NAARGVILPL DDVADALKLS KGDFVEPVWN AGVYDKKRYG IPLDVHPEGN FYNKKLLDEA
GLDPDNPPAT GDDYADALDK LKKAKIKGMW MTPFPFTGSH TFQSLLWQFG GDLFSSDAKD
PAFAEDAGVK ALTWMVDLVK DGHSPKDVGQ DADAVAFQNG KTAFNWNGIW SINTFNDVDG
LEWGVAPLPQ IGEQKAAWAG SHNFVLLKQR TVDTNKQAAS KVFVNWISGK SVEWAKGGQV
PARNSVRDSK EFGKLTEQSV FAEQVDYLHF PPAVPGIGDA MPQVDKAVNQ AVLLKKKPAD
ALADAADKAA KILAENRKKY G