Gene Snas_5603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5603 
Symbol 
ID8886818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5954842 
End bp5956023 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content62% 
IMG OID 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003514326 
Protein GI291303048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.442467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTC GAATCGCTTC TGCCACGGCC ATTGTGGCCG CGACACTGCT GTTGACGGGC 
TGCGTGGAAA GCAGTACCGG AAACGATAAA GCGGAAGGCA ATCTCGTAGG CAAGGGCGAA
GGCGCGAAAT GCCAGATCGA GGATCCCGTC AAAGTGGGCG CTGTATTCAG TCTCACCGAG
GCGGCTGCCT TCGCCGGCAC ATACCAGAAA GAGGGCCTGG AACTGGCCTT CAAAGAACTG
AACAAAAAGG ATGGGGTCGA ATACGAACTC ATCCAGGAAG ACGACAAGAC GGACGTCAAG
GCGAGCCTCG CCGCTTTCGA GAAGCTCATC GACCGTGAGA AGGTAAGCGC GATCGTCGGC
CCCACACTGT CCAACTTGGC CTTCAAGACC TATCAGGACG CCGAGCAGGC GGGCGTCCCC
GCCCTCGGAG TGTCCACCAC CGCCGAAGGC ATCCCCGACA TCGGTGACTA CATCTTCCGG
AATTCGCTGC CCGAGCAGTA CGCCCTCGCC GCCAGCATCC CCGCGGCGAA GGACGCGCTC
GACCTCAAAA AGGTCGCCGT CCTCCACGAC GAGCAGGACG AATTCACCGC CTCGGCCTAC
AAGACCATGA AGGAAACCCT CAAAGAGGCG GAGATCGACA TCGTCGCCGA CGAGCAGTTC
CAGACCACCG ACACCGAATT CCGGTCGCAA CTGACCAACG TCAAGAGCGC CAAACCCGAC
GCGCTCGTCC TGTCCGCCCT GCCCGGTGCC ACCATCCCGC TGGTCAAACA GGCCCGCGAA
CTCGGCATCG ACGCACCCAT CGTGGGCGGA AACGCCTTCA ACTCGCCCGT GCTGATCAAG
CAGCTCGAGG ACGCCGCCGA GGGACTCATC GTCGGTGGCG CCTGGAGTGC CAAGACCGAG
ACCCCCGGCA ACGCCGAATT CATCAAGGCC TACAAGAAGG CCTACGACAA GGATCCCGAC
CAGTTCGCCG CCCAGGCGTA CACCGCCGGA TTCCTCATCG ATGAGGCCGT GCGCACCGAC
TGCGACGGCA ACCGGGACGC CATAAAGGAC AATCTCGGCC AGATCCTGAA GTTCGACACC
GTCCTGGGCA AGCTGTCCCT GGACGAGACC GGCGAGGTCC ACCAGGAACC GGTCGTCCAG
ATCGTCGAGG ACGGCAAGCT CACCCCGCTC AAGAAGAAAT AG
 
Protein sequence
MRIRIASATA IVAATLLLTG CVESSTGNDK AEGNLVGKGE GAKCQIEDPV KVGAVFSLTE 
AAAFAGTYQK EGLELAFKEL NKKDGVEYEL IQEDDKTDVK ASLAAFEKLI DREKVSAIVG
PTLSNLAFKT YQDAEQAGVP ALGVSTTAEG IPDIGDYIFR NSLPEQYALA ASIPAAKDAL
DLKKVAVLHD EQDEFTASAY KTMKETLKEA EIDIVADEQF QTTDTEFRSQ LTNVKSAKPD
ALVLSALPGA TIPLVKQARE LGIDAPIVGG NAFNSPVLIK QLEDAAEGLI VGGAWSAKTE
TPGNAEFIKA YKKAYDKDPD QFAAQAYTAG FLIDEAVRTD CDGNRDAIKD NLGQILKFDT
VLGKLSLDET GEVHQEPVVQ IVEDGKLTPL KKK