Gene Snas_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4801 
Symbol 
ID8886008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5096398 
End bp5097978 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003513535 
Protein GI291302257 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGACT ACCCTGCCCT GGAAGGGTGC GAGCCAGGGG GATTCGACCG CGCGGCCCAG 
GAGGCGGGTG AACTGGCGCA GAACGTCGAG TCCATGATCG ACGAACTGGA GGCCCTGGTA
CCGGTACTGA GCGGTGCCGA CGACGACTTC GCGACCGTCA ACGCGATTCG GGACCGGGTC
GCCGCCGAGA CCGATCGGCT CCGGGAAGCG CAGCTGTCAC TGCGCACCAT CGAGACGTCG
CTGCTGAGCC TCGGCGAGGA CGTCCGGCGT GGCCGAATGG AACTCGAAGA GGCACGTAGG
CGCGCCGACA AGTTTGCCAA AGTGGACTAC TCGCTGGGTG TCATCACGCT CGACTTCGGC
AAGGTACCGC AGGCTGACGC GGACCCCAAG GGATTCGCCC GCTTCAAGGC CCAGCTGCGC
GATGTCGCCG ACCAGATGCG GCGCGCCCAC ACCCTGGCCA ACACCAGTGA CTATGACGCC
GCCTCCGACA TCGCCAAGAA GTCACCCGCG CTGACCGAGC CGGGCCCGAG CCGGATCCGC
AGCCCCTACG AGAACCACAG CTGGTGGACC GGCCGCTCGG CCGCCGAACG GGAACACCGG
ATCGAGACCG AACCCAACCT GGTCGGCGGA CCCGACAGTG ACGGCATCCC CAGCGCCGCG
CGCAGCGCGG CGAACGAACA GTTGCTGCGG CAACACATTC GCGAGCTGGA AGAGCGCGCC
GCGTCGGGCA GCGGCGCGGC GTCCCGACTG CTCGCGGTGA TGCTCGCCGT CCAGCACCGG
CTTGCCGACG ACGCCAGCGA CGGCACCCGG GAGCGGGCGT TCCTGCTGTT GGCCGAACCT
TCCGACCCGG GCCGCATCGT GCTGGCGATC GGCAATCCCG ACATCGCCGA CCACGTCATC
GTCCACATTC CCGGCACCGG TGCCTCGCTG CCCAAGCTCG CAGGGGAAAT CGCCCGCGTC
GAGCGGATGG TCGGCGACGC CAACCTCTAC GAGACCCGCC ACACCGCCGG GATCCTGTGG
CTGGGTTACG ACGCCCCGCC GAACCTGTTC GCGGCCGCGA CCAAGCGCTA TGCGCAGTCG
GCCACAAAGC CGTTGCCGAG GTTCCTGACC GGCATTCGGG CGGTGCGCAG ACGCGGCCTG
GACCGCTTCG GCGTGACCGT GATGGGACAC TCCTACGGCA CCGTCGCCAT CGGGTTCACC
GTCCGGGAAC AACAGGTGTC CGCCAGCCAG CTGATCCTGG TGGCCAGCCC GGGGGTCGGC
GTCAAGTCGG CGGACCAGCT GAAGATCGAC AAGGACGACG TCTACGCCAC CACGGCTCCC
AACGACATCA TCCACAGAGT TCCCAGAGTC CCCCATGGAA CCTCCCCGGT CCACAAATCC
TTCGGCGCCA AGGTGTTCCC GACCCCTCAC CACAACGGCA GCCAGATCGC CGCCCACAGC
GCATACTGGA AGAACGACAA CCCTGTGCGT CAGGACTTCG CCGCGATCAT CACCGGGCAC
GGCCACAAGG TCGTCGGCGG CACCGATGCC GATGTCCGTA TCCAGCCGGG GCCGTGGACG
AACGACGAGG GGGGCGAATG A
 
Protein sequence
MVDYPALEGC EPGGFDRAAQ EAGELAQNVE SMIDELEALV PVLSGADDDF ATVNAIRDRV 
AAETDRLREA QLSLRTIETS LLSLGEDVRR GRMELEEARR RADKFAKVDY SLGVITLDFG
KVPQADADPK GFARFKAQLR DVADQMRRAH TLANTSDYDA ASDIAKKSPA LTEPGPSRIR
SPYENHSWWT GRSAAEREHR IETEPNLVGG PDSDGIPSAA RSAANEQLLR QHIRELEERA
ASGSGAASRL LAVMLAVQHR LADDASDGTR ERAFLLLAEP SDPGRIVLAI GNPDIADHVI
VHIPGTGASL PKLAGEIARV ERMVGDANLY ETRHTAGILW LGYDAPPNLF AAATKRYAQS
ATKPLPRFLT GIRAVRRRGL DRFGVTVMGH SYGTVAIGFT VREQQVSASQ LILVASPGVG
VKSADQLKID KDDVYATTAP NDIIHRVPRV PHGTSPVHKS FGAKVFPTPH HNGSQIAAHS
AYWKNDNPVR QDFAAIITGH GHKVVGGTDA DVRIQPGPWT NDEGGE