Gene Snas_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4078 
Symbol 
ID8885279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4356898 
End bp4358322 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content63% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003512823 
Protein GI291301545 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.305211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA ACCAACTCAC CGGTCAGGGC CGCAACAGGC GTGATGTGCT GCGGCTCGCG 
GGCCTCGGGG CGGCCCTCAC CGTCAGCGCG CCGATGCTGC AAGCCTGCGG TTTCGGCGGC
GGCACCCAAA GCGGTGACTC GGCGGCCAAC GACATCACCG GCAGCTTCGA CTGGAAGAAG
TACGACGGCA AGACCGTCAA GCTGCTGATG AACAAACACC CCTACACCGA CGCGCTCAAG
AAACACAAGG GCGAGTTCGA AAAGAAGACC GGCATCACGC TGGAGATCGA CGAGTTCCCC
GAGTCGAACT ACTTCGACAA GGTCACCCTG GAGCTCCAGT CCAAACAGGG CAACTACGAC
GCCTTCATGC TCGGCGCCTA CATGGTGTGG CAGTACGGCC CGCCCGGATA CCTGGAAGAC
CTCGGACCGT GGATGAAGAA CGAGTCGGCC ACCCACGAGG AGTTCGAGAA GGAGGACTTC
TTCCCGAACC TGCTCAAGGC CGGACAGTGG AACTTCACCA ACGGCGACCC GCTCGGCGGC
AAGCAGCAGT GGATGCTGCC GTGGGGCTTC GAGACCAACG TGGTGTGTTA CCGCGAGGAC
GTCTTCAAGA AACTGAAGCT CAAACCGGCC GAGGACTTCG ACGAGTTCAT CGACCTGGCC
AAGACGCTGG ACAAGAAGGC GGGCGACGGC ATGTACGGCG TCGCGGTGCG CGGTTCCAAG
GAATGGGCCA CCATCCACCC CGGCTTCATG ACCATGTACT CCCGGCTCGG TCTCAAGGAC
TTCGACGTCA AGGACGGCAA ACTCGTCCCC ACGATGAACT CCGCCGACGC GGTGGACTTC
ACCGACAAGT GGGCCAAGAT GGTGCGCGAC TCCGGCCCCA AGGGCTGGAC CTCCTACACC
TGGTACCAGG CCTCCAGCGA CCTGGGCGCG GGCAAGGCCG CGATGCTGTT CGACGCCGAC
TCCGCCTCGT ACTTCCAGAA CGAGGGCACC AAGGCCGCCG GAAAGCTCGC CTGGCACCCC
GGACCCAAGG GGCCCAACGG TTCCCTGGAC ACCAACATGT GGATCTGGTC GCTGGCCATG
AACGCCAACT CCAAGAACAA GGAAGCCGCG TGGTGGTTCC TGCAGTGGGC CACGTCCAAG
GAGCACCTGA AGTACGCCGC CACCAAGGGC CAGCACATCG ACACGGTTCG CCAGTCGATC
GCCGAATCGT CGGAGTACAA GGACAAGTTC GCCGACTACA CCGGCTTCCT CGACACCTTC
GAAAAGGTCA TCGACGACAC CAAGATCCAG TTCACGCCGC AGAAGAACTT CTTCGACGCC
ACCACCTCGT GGGCCGAGGC GCTGCAGACG ATCTACGGCG GCGAGGACGC CAAGTCCACA
CTCGACGACC TCGCGTCCAA CCTTGAGTCG AGGGTCAACG ACTGA
 
Protein sequence
MNDNQLTGQG RNRRDVLRLA GLGAALTVSA PMLQACGFGG GTQSGDSAAN DITGSFDWKK 
YDGKTVKLLM NKHPYTDALK KHKGEFEKKT GITLEIDEFP ESNYFDKVTL ELQSKQGNYD
AFMLGAYMVW QYGPPGYLED LGPWMKNESA THEEFEKEDF FPNLLKAGQW NFTNGDPLGG
KQQWMLPWGF ETNVVCYRED VFKKLKLKPA EDFDEFIDLA KTLDKKAGDG MYGVAVRGSK
EWATIHPGFM TMYSRLGLKD FDVKDGKLVP TMNSADAVDF TDKWAKMVRD SGPKGWTSYT
WYQASSDLGA GKAAMLFDAD SASYFQNEGT KAAGKLAWHP GPKGPNGSLD TNMWIWSLAM
NANSKNKEAA WWFLQWATSK EHLKYAATKG QHIDTVRQSI AESSEYKDKF ADYTGFLDTF
EKVIDDTKIQ FTPQKNFFDA TTSWAEALQT IYGGEDAKST LDDLASNLES RVND