Gene Sare_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3504 
Symbol 
ID5703313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4042828 
End bp4044234 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content69% 
IMG OID641272931 
Productflavin-containing monooxygenase FMO 
Protein accessionYP_001538297 
Protein GI159039044 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2072] Predicted flavoprotein involved in K+ transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00172628 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCACCT CCACCGATCA CGGGCGGTCC GACCTGGAGT CGAGCAGCCT TGCCCTGACC 
CGCGACGGCC GTTCGGTCTC CGACCGCGGT GACACCGTCT GCGTGATCGG GGCGGGTGCG
AGCGGGCTGA CCGCGATCAA GAATCTGACC GAGCACGGGT TCGGCGTCGA CTGCTACGAG
CGGGAGACCG GAGTCGGCGG CGCGTGGAAC TGGCGACACG ACCGCAGCCC GGTGTACGCC
AGCACCCACC TGATCTCGTC GCGTCCATTC ACCCAGTTTC CCGACTTCCC GATGCCGGAC
GACTGGCCGG ACTACCCGCA TCACAGCCAG TTGTTGTCCT ATCTTGAGCG GTACGCGGAA
CACTTCGACC TGCGCCGGCA CGTCTGGTTC GGCACCGAGG TGGTGCGGGT CGAGCCGGCT
GACGGCGACC GGTGGGACGT CACGACCCGC AGTACCGGCG GCTACGGCCC GGAACGCACC
TCCCGGTACG CCGCGGTCGT GATCGCCAAT GGTCACAACT GGTCGCCGAA GCTGCCCGAC
TACGAAGGGC TCGCCGAGTT CCGGGGCGAG GCCATGCACG CCTCGTCCTA CCAGGACCCG
GCGCAGCTGC GGGGCAAGCG GGTGCTGGTG GTGGGTGCCG GCAACACCGG CTGCGACATC
GCCGTCGAGG CCGCGCAGCA GGCGTCGCGC TGCTGGCACG CCACCCGTCG CAGCTACTGG
TACGCGCCGA AGTACGTCCT GGGTCGTCCA GTCGATCAGA TCAACGACGT GCTGCTGGCG
CTGCGGGTGC CCCGGCGGGT CCGACAGTGG CTCTACCACC TCACCCTGCG GCTCACGGTG
GGGGATCTGA CCCGGTTTGG GCTGGCGCGG CCTGACCACA GGATGCTCGA GACACATCCG
ATCGTCAACA GTCAGCTCGT CCACTATCTG GGCCACGGCC GGATCACGCC GGTGCCGGAC
CCCGTCCGTT TCCACCCGCA CTCCGTTGAG CTGGCTGACG GTCGCCGGAT CGATCCGGAA
CTGGTGGTGT TCGCCACCGG CTACTTACCC CGGTTCGACT TCCTCGATCC GAAGATTCTC
GGCGACGACG GCACGGTCGG GCGGCCGGTG TTGTGGCTCA ACGCCTTCGC GCCGAATCAC
CCAACCCTCG CCGTGGCCGG GCTGGTGCAG CCCGACTCGG GCATGTTCCC GCTGTCGCAT
TGGCAGACCG TGCTCTTTGC CCGCCTGCTG CGATCACGCG TGACCCGGCC CGGCCGGGCG
GCGGGCTTCG CCGCCGCGGT GGTTGCCCGG GCGGGGGAGC GCTACGCGGG ACCGGTCAGG
GACAGCAGCC GGCACTGGTT CGAGGTTGGT CACGTCGACT ACCTGCGCGC TCTCCAGCGC
GCCCTGCACG ACCTGGAGGC CAAGTGA
 
Protein sequence
MTTSTDHGRS DLESSSLALT RDGRSVSDRG DTVCVIGAGA SGLTAIKNLT EHGFGVDCYE 
RETGVGGAWN WRHDRSPVYA STHLISSRPF TQFPDFPMPD DWPDYPHHSQ LLSYLERYAE
HFDLRRHVWF GTEVVRVEPA DGDRWDVTTR STGGYGPERT SRYAAVVIAN GHNWSPKLPD
YEGLAEFRGE AMHASSYQDP AQLRGKRVLV VGAGNTGCDI AVEAAQQASR CWHATRRSYW
YAPKYVLGRP VDQINDVLLA LRVPRRVRQW LYHLTLRLTV GDLTRFGLAR PDHRMLETHP
IVNSQLVHYL GHGRITPVPD PVRFHPHSVE LADGRRIDPE LVVFATGYLP RFDFLDPKIL
GDDGTVGRPV LWLNAFAPNH PTLAVAGLVQ PDSGMFPLSH WQTVLFARLL RSRVTRPGRA
AGFAAAVVAR AGERYAGPVR DSSRHWFEVG HVDYLRALQR ALHDLEAK