Gene Snas_2982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2982 
Symbol 
ID8884181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3150866 
End bp3152587 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content65% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003511749 
Protein GI291300471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.270832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0254721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGAA TAGCGGGATA CGCGCGCAGG CATATAGCGC TTGCCATGGT GGCGGCGCTG 
ACGGTCACGA TGGCCGGAGC CTGCACCGCC GACACCCCCA AACCCGAGAC GGTCGCCTCG
AAACCCGGCG AGGGCGGCAC GGTGCGCGTC TTCACCCAGA CCCTGGAGAC CCTCGACCCG
CAGCGCGTGT ACGTCATCAC CGGCCTGAAC GTGTCCACAC TCATCACCCG GACACTGACG
ACGTTCACCG CCAAACCGGG GGAGAAGCCC AAGCTCGTCG GCGACCTGGC CACCGACACC
GGCAAACCCA ACAAGGACAA CACCTCCTGG ACCTACACGC TGCGCGACGG CGTCAAATGG
CAGGACGGCA CCGAGCTCAG CTGCGAGGAC GTGCGCTACG GCGTGCTGCG CAACTTCGAC
GTGCGACGCA AGGACGCCAA GATCACCGGC GGCCCGAGCT ATCCCACCGA ATGGCTTGAC
GTCCCCGAGG ACTACGAAGG ACCCAAGGGC GACAAGTCCG ACAAGGACCT CTCCGGGGTC
ACCTGCGAGA ACAAGCGCAC CATCCGCTTC GACCTGAAGG AGCCGCAGGC CAACTTCCCG
TCCGCGGTGA ACCTGCCCGC GTTCTCCCCG GTCCCGGAGC AGCACGACAC CTGGGCCGAC
TACGGCGAGG AACCGGTGTC GACCGGACCG TACAAACTGT CCTCGTACAA GCCCTCCAAG
GGCGACACGC CGGGACGCGC GGTCTTCGAA CGCAACCGGT TCTGGGACTC CGAGACCGAC
AAGATCCGCG ATGCCAAGCC GGACAAGATC ATCCTCGAAC TCGGCAAGGA CCCCGAGGAG
GTCGCGCAGC AGATCGTGTC CGACAACCCC GGCTACGACA ACGCCGTCCT GTACGACTCG
GTGCCGAACA AGTTCGTCTC GCAGGTCGTC AACGACAAGC AGCTGAAGAA GCAGACGGTG
TCGGGCTCCA CCAGCGGCGT CACCTACATG GCGATCAACA CCGAGACCGT GGAGGGCCGC
GACTGTCGGC GGGCGTTGAT GTACGCGTTC AACAAGAGCA AGTACATGGA CGCCATCGGC
GGCGACGTGT TCGGCGACTA CGCCACCACG ATGCTGCCGC CGTCGGATCC GGCGCACCGC
GACCACGACG TCTACGGCCT GGACGGCGAC CCTGACGGCG ATCTCGACAA GGCCCGGGAA
CTGCTCGAGG AGGCCAAGGG CTGTCCCGAA AGCCTCACCC TCGACGTGCA GGACACCGAA
CGCGGTGAAC GCACCGGCGA CACGATTGCG GAGACCTTCG GGCGCCTGGG CATCACCGTC
AAGGTCAACA AGATCGCCCC GAACAAGTAC TTCGACACGC TGTCCCAACC GGACAAGCTG
CACGACCTGA CGATCGCCTC CTGGATCCCG GACTGGCCGG GCGGCTCGGG CGTGATCCCG
GCGCTGTTCG ACGGCGACCT CATCAAGCCC GGCCTCAACA GCAACTACTC CAAACTGGAT
GATCCCAAGA TCAACGAGCG GATCGACGAG GCCAGTATGG AGACCGACCG CAAGCGGTCG
TACGAACTGT GGGCCGACCT GGACGAGCAG ATCCAGAAGG AGGCCGCGGT CGTCCCGATC
GTGTACCCCA AGGCGCTGAA CCTGTGTGGC GTCGACGTAC GCGGCGGCGT GCTGAACCCG
CAGTGGGGCG GCATCGACTT CGCCTCGCTG GGTGTCAAAT GA
 
Protein sequence
MRGIAGYARR HIALAMVAAL TVTMAGACTA DTPKPETVAS KPGEGGTVRV FTQTLETLDP 
QRVYVITGLN VSTLITRTLT TFTAKPGEKP KLVGDLATDT GKPNKDNTSW TYTLRDGVKW
QDGTELSCED VRYGVLRNFD VRRKDAKITG GPSYPTEWLD VPEDYEGPKG DKSDKDLSGV
TCENKRTIRF DLKEPQANFP SAVNLPAFSP VPEQHDTWAD YGEEPVSTGP YKLSSYKPSK
GDTPGRAVFE RNRFWDSETD KIRDAKPDKI ILELGKDPEE VAQQIVSDNP GYDNAVLYDS
VPNKFVSQVV NDKQLKKQTV SGSTSGVTYM AINTETVEGR DCRRALMYAF NKSKYMDAIG
GDVFGDYATT MLPPSDPAHR DHDVYGLDGD PDGDLDKARE LLEEAKGCPE SLTLDVQDTE
RGERTGDTIA ETFGRLGITV KVNKIAPNKY FDTLSQPDKL HDLTIASWIP DWPGGSGVIP
ALFDGDLIKP GLNSNYSKLD DPKINERIDE ASMETDRKRS YELWADLDEQ IQKEAAVVPI
VYPKALNLCG VDVRGGVLNP QWGGIDFASL GVK