Gene Snas_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4144 
Symbol 
ID8885345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4434185 
End bp4435768 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content62% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003512888 
Protein GI291301610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.482132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00782841 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGCAA AACCCCTTAT AGCGCTCGCA GCTGCCGCGG TGCTTACCGC GTCCCTCGCC 
GCCTGTGGCG GCGGTGGCGG CGAAAGCGCC AGCGGCGACA TGAAGTTCGG ACTCGGCGCT
GAGCCGCCGA GCCTGATTCC GCCGAACACC ACCGAGTCCG AGGGTTCGAT CATCCTCAGC
CTGACGTACG CGGGGCTGTT CGACTACAAC GAGAAGGCCG AAGCCGAGAA CCTGCTGGCC
AAGTCGATCG AAACCAAGGA CAACAAGACC TGGACGATCG AGATCGAGGA CGGCCACAAG
TTCCACAACG GCGAGCCCGT TGACGCGGAA GCGTTCAAGC GCGCCTGGGA CTGGGGCGCC
TACGCCCCCA ACGCCGCCAC CGGCAGCTAC TTCTTCGACC GCATCAAGGG CTTCGCCGAC
ATGCAGGGCA AGAAGCCCAA GGCCAAGGAG CTCGCGGGCC TCAAGGCCGA CGGCCAGACC
CTGACCATCG AGCTGGACAA GCCGTGGGCC GGTTTCTCCA CCATGCTGGG CTACACCGCC
TTCCTGCCGA TGGCCGAGGA GTGTGTCAAG GACACCAAGG CCTGCAACGA CAAGCCCATC
GGCAACGGCC CGATGAAGTT CGATGGCAAG TGGAAGCACA AGGAATCCAT CTCGCTGGTC
AAGAACGACA AGTACAAGGG CCCGAACAAG GTCAAGTACG ACAAGGCCGA CTTCACCATC
TACGACGGTG ACGCGAACTG CTGGGCCGAC TTCGAGTCCG AGGACATCGA CATCTGCTCC
CCGCCGGTCG AGAAGTGGGA AGGTGCCAAG AACGACCCGG ACCTGAAGGA ACGCCTGATC
TCCGAGCCGA GCACCTCGCT CACTTACCTG GGTATGCCGG TCTACGACAA GGCCTTCCAG
GACAAGAAGC TCCGTCAGGC CTTCAGCCTG GCGATCGACC GCGAGAAGGT CATCGACATC
GCCACCCCCG GCCGCGCGGT TCCGGCGACC GAGTTCGCTC CGCCGTCGCT GCCCGGTGGC
GTCGAAGGCG CCTGCGAGTT CTGCAAGTTT GACGCCAAGG AAGCCAAGAA GCTCTTCGAG
GAGTCGAGCT GGCCCAAGGG CAAGAAGATG GAGATCTGGT ACAACGCCGA CCCGACCAAC
AAGGCGATCT TCGAGGCCGT CGGTAACCAG CTGAAGAAGA ACCTGGGCGT GGAGTTCGAG
CTCGTCGTGA AGGACTGGGA CCCGTTCCTG GCTGCCACCG ACAAGCACAA GGCCAAGGGT
CCGTTCCGTA TGGGCTGGCT GCCGGACTAC CCGCTCAACG AGAACTACCT GAAGCCGATC
TACGGCAACG GCGCGGCGAA CAACCGCTTC GGTTACGAAG GCAAGGACTT CAACAAGAAG
CTCGCCGAAG CCGACGCCGC CAAGACCCTC GAAGAGGGTA TGGAGATCTA CGCCGAAGCC
GAGAAGATCC TGGCCGAGGA AATGATCGGT ATCCCGCTCT CCTGGGCGAA GTCGAGCAGC
TTCTACAGCG AGAACATCGA CCCGAAGAGC GTGAAGTACT TCCCGGTTGG CCAGATCCAG
TACGACAAGC TGGCTCCCGC CTAA
 
Protein sequence
MRAKPLIALA AAAVLTASLA ACGGGGGESA SGDMKFGLGA EPPSLIPPNT TESEGSIILS 
LTYAGLFDYN EKAEAENLLA KSIETKDNKT WTIEIEDGHK FHNGEPVDAE AFKRAWDWGA
YAPNAATGSY FFDRIKGFAD MQGKKPKAKE LAGLKADGQT LTIELDKPWA GFSTMLGYTA
FLPMAEECVK DTKACNDKPI GNGPMKFDGK WKHKESISLV KNDKYKGPNK VKYDKADFTI
YDGDANCWAD FESEDIDICS PPVEKWEGAK NDPDLKERLI SEPSTSLTYL GMPVYDKAFQ
DKKLRQAFSL AIDREKVIDI ATPGRAVPAT EFAPPSLPGG VEGACEFCKF DAKEAKKLFE
ESSWPKGKKM EIWYNADPTN KAIFEAVGNQ LKKNLGVEFE LVVKDWDPFL AATDKHKAKG
PFRMGWLPDY PLNENYLKPI YGNGAANNRF GYEGKDFNKK LAEADAAKTL EEGMEIYAEA
EKILAEEMIG IPLSWAKSSS FYSENIDPKS VKYFPVGQIQ YDKLAPA