Gene Sare_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0472 
Symbol 
ID5703651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp540234 
End bp541745 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content72% 
IMG OID641269997 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_001535392 
Protein GI159036139 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.477344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA TCGTGGGGCA GGTCCTGCTG GTCGCGGTGC TGGCAGTGCC GACGGTCGGT 
GCGGCGGTCG TCGTCGCGCT TCGACACGAC CGGGCGGCCC GCCTGGTCGG CACGGTGGCC
GCCGGGCTGA CCCTGCTCGC CACGCTGCCG CTGGTCGCCG GGCATGACGA CAGCGGCGTG
GGCACCGACT CGACACCCGC GGTACGACCC TGGCACCAGG TGGACCTGCC CTGGGTGCCC
GGCCTGGACC TGCGCTTCCA CCTCGGCGTC GACGGCATCT CCTGGCCGCT GGTGGTGCTG
ACCGCGCTGC TGACCCTGCT CTGCTGCGGC TACACGCTGG GGAGGGTACC CAGCGGGGGC
AGCGGTCGAG CCCTGGTGGC GTTGCTGCTG CTGGTCGAAG TGGGCATCCT CGGCACCTTC
CTCGCGCTCG ACCTGGTGCT CTTCTTCGTC TTCTTCGAGG TCGTCCTCCT GCCGATGTAC
GCGATCATCG CCGGCTGGGG CGGGCCCGAC CGGCACCGGG CGGCCCGTAA GTTCGCCCTC
TACACACTGT TCGGCTCAGT GCTGCTGCTG GTGGGGGTAC TGGTGGTGGT GACCACTGCC
GGCACCGCGG ACGTCGTGGC GCTGACCGGC GGCACCGGAC TCTCCCGCGG CCCGCAACTC
GCCGCGTTCA CCCTGCTGGC ACTCGCCTTC GCGGTGAAGA GCCCACTGTG GCCACTGCAC
TCCTGGCTGC CCGACGCGCA CACCCAGGCA CCGACCGTGG GCAGCGTGAT CCTCGCCGGA
GTGCTGCTCA AGATGGGCAC GTACGGGCTG ATCCGGATCG CGGTCGGTGT CGCCCCCGAG
GGCGCCGACT GGGCCGCGCC GGTGCTCGGT GTGCTCGCCG TCGCGGCGAT CCTGGTCGGA
TCCCTGGTCT GCCTGGCGCA GACCGAGCTG AAGCGGCTGA TCGCGTACTC CAGCGTGGGG
CACATGGGTT TCGTGCTCCT CGGCGTCGCC ACGCTCACCG GTACCGGGCT TCAGGCGGCC
CTGATCGGCA ACGTCGCGCA CGGGATCATC ACCGGCCTGC TGTTCTTCCT CGCCGGCGCC
GTGAAGGACC GGGCGCACAC CGGTGATCTG GTCGACCTGT CCGGTTTACG GGAGACCGCA
CCCCGGCTGG CCGGGGTGCT CGGCTTCGCC GCCGTCGCCT CACTGGGCCT GCCTGGCCTG
GCCGGCTTCT GGGGGGAGGC GTTCGCCGTG GTCGCCGCGG TCCGCGTCGG TGGTCCCCTC
TGGCTGACCC TCGCCGTGCT CGCGGCGCTC GGCGGCGCGC TGACCGCCGC GTACCTCCTC
CGGCTGCTCC GCCAGGTCAC CCACGGCCGG CCCAGCCCAG CGGTGGCGTC GGTCAGGCCC
GGTGTGGCGG GGGTGGAACT GGTCACCTGG GCGCCACTGG TGTTGCTCAC GCTCGCCGTC
GGACTGGCCC CGATTCTGGT CCTCGGCGTG GCCCACGCAC CGGTCGACGC GCTGCTGGCG
GGTCTGCCAT GA
 
Protein sequence
MSGIVGQVLL VAVLAVPTVG AAVVVALRHD RAARLVGTVA AGLTLLATLP LVAGHDDSGV 
GTDSTPAVRP WHQVDLPWVP GLDLRFHLGV DGISWPLVVL TALLTLLCCG YTLGRVPSGG
SGRALVALLL LVEVGILGTF LALDLVLFFV FFEVVLLPMY AIIAGWGGPD RHRAARKFAL
YTLFGSVLLL VGVLVVVTTA GTADVVALTG GTGLSRGPQL AAFTLLALAF AVKSPLWPLH
SWLPDAHTQA PTVGSVILAG VLLKMGTYGL IRIAVGVAPE GADWAAPVLG VLAVAAILVG
SLVCLAQTEL KRLIAYSSVG HMGFVLLGVA TLTGTGLQAA LIGNVAHGII TGLLFFLAGA
VKDRAHTGDL VDLSGLRETA PRLAGVLGFA AVASLGLPGL AGFWGEAFAV VAAVRVGGPL
WLTLAVLAAL GGALTAAYLL RLLRQVTHGR PSPAVASVRP GVAGVELVTW APLVLLTLAV
GLAPILVLGV AHAPVDALLA GLP