Gene Sare_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4721 
Symbol 
ID5706023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5344268 
End bp5345425 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID641274119 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_001539465 
Protein GI159040212 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000278675 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCACGG ACGCCGAGCT CCGCGAACTG ACCGTTGGTA CCGGCGCCGG TGGGGAGCAG 
CTGGGCACGG ACATGGTGCT CAACATTGGT CCACAGCACC CCTCGACGCA CGGCGTGCTC
CGGTTGCGGC TGGTACTCGA CGGCGAACGG GTGGTGAGCG CCGAGCCGAT CGTCGGGTAC
ATGCACCGGG GCGCGGAGAA ACTGTTCGAG GTCCGCGACT ACCGACAGAT CATCGTGCTG
GCGAACCGGC ACGACTGGCT CTCGGCGTTC GCCAACGAGC TCGGGGTGGT GCTCGCCGTG
GAGCGGCTGA TGGGCATGGA GGTTCCGGAG CGGGCCACCT GGCTCCGGAT GGCGCTCGCC
GAGCTGAACC GGGTACTCAA CCACCTGATG TTCCTCGGCT CGTACCCGCT GGAGATCGGC
GCTATCACCC CGATGTTCTA CGCATTTCGG GAACGGGAGA CGCTTCAGGC GGTGCTCGAG
GAGGTCTCCG GCGGGCGGAT CCACTACATG TTCAACCGGG TCGGTGGGCT CAAGGAGGAG
GTGCCGGCCG GCTGGACCGG TCGGGCCCGG ACGGCCATCG GCGAAGTACG GCGGCGCATG
CCCGACCTGG ATCGCCTGAT CCGGCGGAAC GAGATCTTCC TGGCCCGGAC CGTCGGCGTG
GGGGTGCTCT CGGCGGCCCA GGCCGCCGCG TTCGGCGCGT CCGGACCGGT CGCCCGGGCC
TCCGGCCTCG ACTTGGACCT ACGCCGGGAC GAGCCGTACC TGGCGTACGA CCAACTCGAG
GTGCCGGTGG TGACCCGCAC CGCCGGCGAC TGCCACTCCC GCTTCGAGGT GTTGCTCGAC
CAGGTGTATG TCTCGCTCGA CCTCGCCGAG CAGTGCCTGG ACCAGGTGGA CCGGCTCACC
GGACCGGTCA ACACCCGGCT ACCGAAGGTG CTCAAGGCGC CCGAGGGGCA CACCTACGCC
TGGACCGAGA ACCCGCTCGG AATCAACGGG TACTACCTGG TGTCCCGGGG TGAGAAGACA
CCGTGGCGGC TCAAGCTGCG CACCGCGTCG TACGCCAACG TGCAGGCGCT GGCCACACTG
CTGCCGGGTT GCCTGGTGCC GGACCTGATC GCCATCCTCG GCTCGATGTT CTTCGTGGTG
GGTGACATCG ACAAGTGA
 
Protein sequence
MTTDAELREL TVGTGAGGEQ LGTDMVLNIG PQHPSTHGVL RLRLVLDGER VVSAEPIVGY 
MHRGAEKLFE VRDYRQIIVL ANRHDWLSAF ANELGVVLAV ERLMGMEVPE RATWLRMALA
ELNRVLNHLM FLGSYPLEIG AITPMFYAFR ERETLQAVLE EVSGGRIHYM FNRVGGLKEE
VPAGWTGRAR TAIGEVRRRM PDLDRLIRRN EIFLARTVGV GVLSAAQAAA FGASGPVARA
SGLDLDLRRD EPYLAYDQLE VPVVTRTAGD CHSRFEVLLD QVYVSLDLAE QCLDQVDRLT
GPVNTRLPKV LKAPEGHTYA WTENPLGING YYLVSRGEKT PWRLKLRTAS YANVQALATL
LPGCLVPDLI AILGSMFFVV GDIDK