Gene Snas_6149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_6149 
Symbol 
ID8887371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6507528 
End bp6509228 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content63% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003514865 
Protein GI291303587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.784749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACA CCAGATCCAA ACCCGTACGC CTGCTCGCCG TACTGACGGC CGCTGGCGTC 
GCCGCCGCGC TATCGGCCTG TGCCGCCGAT CCCGGCGCCG CCAAGGACAA CGACAAGACC
GCGTTCAAGT TCGCCACCGC CAGCGAACCG ACCTCTTTGG ACCCGGCGCT CGCCTCCGAC
GGTGAGACCT TCCGGGTCAA CCGCCAGGTC GTGGAGACCC TGGTGGAGCA CAAGACCGGT
GGCGACGAGC TGGTTCCCGG GCTCGCCAAG GAGTACTCGC CCAGCAAGGA CGGCCGGACC
TGGAACTTCA CCCTCAACGA GGGCGTCAAG TTCCACGACG GCGACGACCT GACCGCCGAA
GCCGTGTGCG CCAACTTCGA CCGTTGGTAC AACTGGAAGG GCGTCTACCA GAACCCGGCG
CTGTCGGGCT ACTGGCAGGA CATCATGGGC GGCTTCGCCA AGAACGAGAA CAAGGACCTG
CCCAAGTCCA ACTACGACGG CTGTAAGACC GACGGCGAGT TCAAGCTGTC CATCAAGGTC
AAGGAACCCA CCGCGAAACT GCCGGGCGGC TTCTCACTGT CGTCGCTGGG CATCCTGAGC
CCGAAGACGC TGGCCGCGGC CGACAAGCAG CAGCCCAAGC AAGAGGGCGA GTCCATCAAG
TACCCCGACT ACAGCCAGGA GGTCGGCACC ATCGCCGGCA CCGGGCCGTA CGAGTACTCG
AAGTGGGACA AGAGCCAGCA GGAAGTCACC ATCAAGGCCA ACAAGGACTA CTGGGGCAAG
ACCAACGCCA AGATCAAGAC CATCATCTTC AAGGCCATCT CCGAGGAGAA CGACCGCAAG
TCGGCCCTGA TCTCCGGCGA CGTCGACGGT TACGACCTGG TGGCGCCGCA GGACATCGAC
GACCTGAAGA AGAAGGACAT GAACGTCCTC ACCCGGGATC CGTTCAACAT CTTCTACATC
GGTCTGAACC AGAAGCTCGT CGACCTGAAG ACGAACAAGG GCAAGAAGAC CGTCTTCGCC
GACAAGGACG TCCGCAAGGC CATCGCGCAC TCCATCAACA AGGACAAGAT CATCAAGCAG
ATCTATCCGA AGGGGACCGA GGCCGCCACC CAGTTCCAGC CGCCGTCGCT GGACGGCTGG
TCGGACAACG TGCCGAAGTA CGAGTACGAC AAGGACAAGG CCAAGGCGCT GCTCAAGAAG
GCCGGGCAGT CGGACATGAA GATCGACTTC TGCTACCCGA CCAAGACCAC GCGGCCCTAC
ATGCCGGACC CGAAGTCCAT CTTCGACAAC ATGAAGTCGG ACCTGGAGGC CGTCGGCATC
ACCGTCGAGG AGAAGCCGCT GCAGTGGTCG CCGACCTACG GTGACCAGAC CTCCGCCGGT
GGTTGCAGCA TGTACATCCT GGGCTGGACC GCCGACTACG CCGAGGCGTT CAACTTCAAC
GGCACCTGGT TCTCGCAGTA CACCCCGGCC TGGGGCTTCA AGGACGACAA GGTCTTCGAC
GCGCTGGCCA AGGCCAACGC CGAAGCCGAC CCCGCCGAAC GCGCCAAGCT GCACCAGAAG
GCGAACGAGG CCATCATGGA CTACGTCCCC GGTGTCCCGA TCTCGCATTC CTCGCCGTCC
ATCGCCTTCG CCGACTACGT GAAGGCTCCG ACCCTGTCCC CGCTGACTCA GGAAAACTTC
GCTGAGACCA GCTTCAAGTA A
 
Protein sequence
MRNTRSKPVR LLAVLTAAGV AAALSACAAD PGAAKDNDKT AFKFATASEP TSLDPALASD 
GETFRVNRQV VETLVEHKTG GDELVPGLAK EYSPSKDGRT WNFTLNEGVK FHDGDDLTAE
AVCANFDRWY NWKGVYQNPA LSGYWQDIMG GFAKNENKDL PKSNYDGCKT DGEFKLSIKV
KEPTAKLPGG FSLSSLGILS PKTLAAADKQ QPKQEGESIK YPDYSQEVGT IAGTGPYEYS
KWDKSQQEVT IKANKDYWGK TNAKIKTIIF KAISEENDRK SALISGDVDG YDLVAPQDID
DLKKKDMNVL TRDPFNIFYI GLNQKLVDLK TNKGKKTVFA DKDVRKAIAH SINKDKIIKQ
IYPKGTEAAT QFQPPSLDGW SDNVPKYEYD KDKAKALLKK AGQSDMKIDF CYPTKTTRPY
MPDPKSIFDN MKSDLEAVGI TVEEKPLQWS PTYGDQTSAG GCSMYILGWT ADYAEAFNFN
GTWFSQYTPA WGFKDDKVFD ALAKANAEAD PAERAKLHQK ANEAIMDYVP GVPISHSSPS
IAFADYVKAP TLSPLTQENF AETSFK