Gene Sare_4456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4456 
Symbol 
ID5704947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5034758 
End bp5036104 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content68% 
IMG OID641273872 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001539221 
Protein GI159039968 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.498468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTTCC TCGCCCAGGA ACCGACGCTG GCCGATTTCG GCCGGGATCC GTGGTGGCTG 
ATCCTGATCA AGGTCGTCTT CGCGTTCGCG TTCGGCCTGG TGGCCACGCT GCTCGGGGTC
TGGTTCGAAC GGCGGGTCGT CGGCCGGATG GCGGTACGGC CCGGCCCCAA CCAGCTCGGC
CCGTTCGGCC TGCTCCAGAC GCTCGCGGAC GGCGTGAAGA TGGCCTTCAA GGAGGACATC
CTCCCGCGGT CCGCGGACAA GGTCGTCTAC TTCTTCGCCC CGGTCATCTC GGTGGTCTGT
GCGGTCACCG CGCTGTCGGT GATGCCGTTC GGCCCGATGG TCAGCATCTT CGGGCACCAG
ACGCCGTTGC AGGTCACCGA CGTGTCGGTG GCGGTGCTGC TGGTGCTGGC CTGCTCGTCG
ATGGCGGTGT ACGGCGTGGT GCTGGCCGGC TGGGCCTCCG GGTCGACCTA CCCACTGCTC
GGTGGTCTGC GGTCCAGCGC GCAGCTGATC TCGTATGAGA TCGCGCTGGG GCTCTCCGTC
GTGGCGGTGT TCATGCTCTC GGGCACGATG TCGACCAGCG GGATCGTCGC CGCCCAGGGG
GAGCGGCCCC AGGTCGAGTT CTTCGGTCTC GACGTCTCGG CTCCCGGCTG GTACGCGATC
CTGCTCTTCC CGAGCTTCGT CATCTTCTTC ATCGCCATCG TCGGCGAGAC CAACCGAGCC
CCGTTCGACC TGCCCGAGGC GGAGTCCGAG CTGGTCGCCG GCTTCATGAC GGAGTACAGC
TCGCTGAAGT TCGCGCTCAT CATGCTCTCC GAGTACGTCG CGATGGTGAC CATGTCGGCG
TTCACCGTGA CGTTGTTCCT CGGCGGCTGG CGCGCACCCT GGCCGCTGAG CATCTGGGAC
GGGGCAAACT CCGGTTGGTG GCCGATGCTG TGGTTCTTCG GCAAGGTGCT CGCCCTCGTC
TTCGTCTTCG TCTGGCTGCG GGGCACCCTG CCCCGGCTGC GCTACGACCA GTTCATGCGC
CTCGGCTGGA AGGTCCTGCT CCCGCTCAAC CTGCTGTGGA TCCTGGTGCT GGCCGGGTGG
CTGAAGACCC AGGGCTGGGA GCGCGCCGAC CGGCTGATCG CGTACGGGGC CGTCGCCGGG
GTGGTGCTGA TCGTCACGCT GATCTGGCCG AGCCGCAAGC CGGCAGCGAA GCCGACGCTG
GCCGAGGAGG TCAGCAACCG GCCCTATGGC AGCTTCCCGC TGCCGCCGCT GGACCTTCAG
GTACCACCGA GCCCGCGAAC CCAGCGCATC GTTGCCGAGC GGGAGCCGGC CAACCTCACC
ACCGGCACGG ATTCCAGGGA GGTGTGA
 
Protein sequence
MTFLAQEPTL ADFGRDPWWL ILIKVVFAFA FGLVATLLGV WFERRVVGRM AVRPGPNQLG 
PFGLLQTLAD GVKMAFKEDI LPRSADKVVY FFAPVISVVC AVTALSVMPF GPMVSIFGHQ
TPLQVTDVSV AVLLVLACSS MAVYGVVLAG WASGSTYPLL GGLRSSAQLI SYEIALGLSV
VAVFMLSGTM STSGIVAAQG ERPQVEFFGL DVSAPGWYAI LLFPSFVIFF IAIVGETNRA
PFDLPEAESE LVAGFMTEYS SLKFALIMLS EYVAMVTMSA FTVTLFLGGW RAPWPLSIWD
GANSGWWPML WFFGKVLALV FVFVWLRGTL PRLRYDQFMR LGWKVLLPLN LLWILVLAGW
LKTQGWERAD RLIAYGAVAG VVLIVTLIWP SRKPAAKPTL AEEVSNRPYG SFPLPPLDLQ
VPPSPRTQRI VAEREPANLT TGTDSREV