Gene Snas_4944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4944 
Symbol 
ID8886151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5252765 
End bp5254132 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID 
Productflavin-containing monooxygenase FMO 
Protein accessionYP_003513678 
Protein GI291302400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.310384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGT ACGGCGTGGA CGCCGCATAC GATCGCGGGG ACGCGGTGTG TGTCATCGGT 
GCCGGGATGG CGGGCCTGGT GGCGGTCAAG AACCTGCGCG AACACGGCTT CAATGTGGAC
TGTTACGAGC AGGAGACCGA GATCGGCGGC TCCTGGAACA TCAAGAAGCG CCGCAGCCCC
ACCTACGCCA ACACCCACTT GGTCTCGTCG CGGACCCAGA CCGAGTTCCC CGACTTCCCG
ATGCCCGACG ACTGGCCGGA CTATCCACAC CACAGCAAGG TGCTGTCCTA CCTGGAGAGC
TACGCCGACC ACTTCGGACT GCGCGAGCAC ATCTGGTTTG GCAGCGAGAT CGAGCGCATC
GAGAACGCCG AACGCGGCCG CTTCGACGTC GTCGTCAAAC CGATGTCCGG CAGCGCCGCC
CGCAGACTGC GCTACGCCGC CGTCGTCATC GCCAACGGGC ACAACTGGGA CCCGTTCCTG
CCGGAGTACC CCGGCCAGCA GGCCTATCGC GGCGAGATCA TCCACTCGGT GTCCTACCAG
GACTCGTCGC AGCTGCGCGG CAAGAAGGTG CTGATCGTCG GCGCCGGGAA CTCCGGCTGC
GACATCGCCG GCGAATCGGC GATCACCGCC AAACGGACCT GGCAGTCCAC CCGGCGCGGC
TACTGGTACA CGCCCAAGTA CATGCTCGGA CTGCCCGCCG ACAAGACCGC GCAGCGCCTG
TCGTGGCTGC CCAAAGGCTT GCGGCGCAAG GTGACCGAGT ACGCGATCAA GAAGATCGGC
GGCGACCCGG TCCGGTTCGG ACTGCCCGCC CCCGACCACC GTTTCGGACA GTCGCACCCC
ATCGTCAACA GCCACATCCT GCACCACATC GGACACGGGG CCCTGGAGCC CAAACCCGAC
ATCGCGCGCT TCGACGGTCG CAAGGTGGTG TTCACCGACG AGTCCACCAT CGAGCCCGAC
CTCGTCGTCA TGGCCACCGG CTACCGTCCC CGCTACGACT TCTGCGACGA CGAACTGCTG
GGCGCCGGAC GCGAGACCGG CGGCTTCCCG CGACTGTTCG CGCAGATGTT CTCGCCCGCG
TCGGAGACGC TGTTCGTGGC GGGGCTGTTG CAGGCCGACG TCGGCATCTT CCCGCTGGTG
CACTGGCAGA CGGTGGCCAT CGCGAAGTGG CTGCACGTGC GGGTGTCCGA CCCGGAACGG
GCCAAGGCCT TTCGCGGCCA GGTCGTCACC GAGGCCGGAA ACCGTTACGT GGACGCGGAG
ATGAACGACT CCGACCGGCA CCGGCTCGAG GTCTCGCACG ACCGTTACCT CGACTCGGTG
GCTCGCATCA TTGAGACCCT TGACAAGGAA TCGGACGGTG CCCGATGA
 
Protein sequence
MSEYGVDAAY DRGDAVCVIG AGMAGLVAVK NLREHGFNVD CYEQETEIGG SWNIKKRRSP 
TYANTHLVSS RTQTEFPDFP MPDDWPDYPH HSKVLSYLES YADHFGLREH IWFGSEIERI
ENAERGRFDV VVKPMSGSAA RRLRYAAVVI ANGHNWDPFL PEYPGQQAYR GEIIHSVSYQ
DSSQLRGKKV LIVGAGNSGC DIAGESAITA KRTWQSTRRG YWYTPKYMLG LPADKTAQRL
SWLPKGLRRK VTEYAIKKIG GDPVRFGLPA PDHRFGQSHP IVNSHILHHI GHGALEPKPD
IARFDGRKVV FTDESTIEPD LVVMATGYRP RYDFCDDELL GAGRETGGFP RLFAQMFSPA
SETLFVAGLL QADVGIFPLV HWQTVAIAKW LHVRVSDPER AKAFRGQVVT EAGNRYVDAE
MNDSDRHRLE VSHDRYLDSV ARIIETLDKE SDGAR