Gene Snas_5891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5891 
Symbol 
ID8887107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6249704 
End bp6250912 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003514612 
Protein GI291303334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.403986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA TCCCCAGCGT GCTCCTGGCG GCCGCGATCG CGGCCGTGAG TCTGAGCGCC 
TGTAGTGCCG AAGCCGACAA GGCCAAACTC GTGATCTGGG CCGACGAGGT CCGCGGCGAG
GTGCTCAAGC CCTACGCCAA GCAGTTCGGT GAGGACAACG GCATCGAAGT CTCGGTGGAG
GTCCACGCCG AGGAACTGCA GGAGGACTTC ATCACCGCGT CCGAACAGGG CAAGGGCCCC
GACATCCTGG TCGCCGCCCA CGACTGGATC GGCAACCTGG TGCAGAACAA GGCGATCGAC
CCGGTGCAGC TGAGCTCCGA GCAGAAGAAG GCCCTGGACC CGACGGCCCT GGAAGCCGTC
ACCTACGACG GCGACGTCTA CGGCAACCCG TACGCCGTGG AGAACCTGGC CCTGATCCGC
AACACCAAGC TGGCGCCCGA ACAGCCGAAG ACCATTGAGG ACCTGACTGC CACCGGCAAG
AAACTCAAGA AGGACGGCAA GACCGACGAC ATCCTGTCCA TCGAGGTCAG TGACGTCGGC
AACCCGTACC ACGCCTACCC GTTCTACGCC TCCGCCGGTG GTTACCTGTT CGGCCAGGAC
GACAAGGGCC AGTACGACCC GGGGGACCTG GGACTGTCCA AACCCGAGGC CGTCGACGCC
TTCGCCAAAC TCGCGAAGCT CGGCGAGGAC GGGGTCATGA AGACCTCGAT GAGCGCCGAC
AACTCGATTC CCAAGTTCGT CGAGGGCAAG ACCCCGTACC TGGTGTCGGG ACCGTGGGCG
CTGTCCCAGG TGAAGAAAGC CAAACTGGAC TACGAGGTCA CCGAGATCCC CGGCTTCAAG
GGCGGGAAGA AGGCCACCCC GTTCGTCGGC GTGCAGGCGT TCTACGTGGC CAGCAAGGGC
GACAACAAGA CCCACGCCCA GGAGTTCGTC ACCAACTACG CCGCCACCGA GAAACTCAAC
AAGGACCTGT TCGAGGCCGA TCCCCGGATG CCCGCGCTGA CCGACGTCCG CGCCGAGGTC
AGCGCCGACA ACAAGGACAT CGCCGGTTTC GAGAAGGCCG GTGAGGGCGG CACGATCCTG
CCCGCCATTC CCGCGATGTC CGCCGTGTGG GACCCGTTCG GCAAGGCCCA GGCGGCGGTG
CTGAAGGGCG AGGACCCCGA AAAGGCCGTC AAGTCCGCGG CCAAGACCAT CACCGAGGCC
ATCGGATAA
 
Protein sequence
MRKIPSVLLA AAIAAVSLSA CSAEADKAKL VIWADEVRGE VLKPYAKQFG EDNGIEVSVE 
VHAEELQEDF ITASEQGKGP DILVAAHDWI GNLVQNKAID PVQLSSEQKK ALDPTALEAV
TYDGDVYGNP YAVENLALIR NTKLAPEQPK TIEDLTATGK KLKKDGKTDD ILSIEVSDVG
NPYHAYPFYA SAGGYLFGQD DKGQYDPGDL GLSKPEAVDA FAKLAKLGED GVMKTSMSAD
NSIPKFVEGK TPYLVSGPWA LSQVKKAKLD YEVTEIPGFK GGKKATPFVG VQAFYVASKG
DNKTHAQEFV TNYAATEKLN KDLFEADPRM PALTDVRAEV SADNKDIAGF EKAGEGGTIL
PAIPAMSAVW DPFGKAQAAV LKGEDPEKAV KSAAKTITEA IG