Gene Sare_4458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4458 
Symbol 
ID5704949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5038789 
End bp5040120 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content69% 
IMG OID641273874 
ProductNADH-quinone oxidoreductase, F subunit 
Protein accessionYP_001539223 
Protein GI159039970 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACTC CTCGGCCGGA GACGCTGGCC AAGCTCACGC CGGTGCTCAC CAAGCGCTGG 
CTGTCGCCGG ATGCCTGGCG GATCGGCACC TACGAGCGGC TGGACGGGTA CGCGGCGCTG
CGCAAGGCGC TGGGTGCCCA CCCGGACGAC CTGATCCAGC TGGTCAAGGA CTCCGGGCTG
CGTGGCCGTG GTGGCGCGGG CTTCCCCACC GGTCTCAAGT GGGGGTTCAT CCCGCAGGAA
CCGGATGGCA GGTCGCCGAC GCGACCGCAC TACCTGGTGG TCAACGCCGA CGAGGGTGAG
CCGGGTACCT GCAAGGACCT ACCGCTGATG ACGCACGACC CGCACTCCCT GATCGAGGGT
GTGATCATCG CGTCGTACGC GATCCGGGCC AACCGCGCCT ACATCTACAT CCGTGGCGAG
GCGGTGCACG CCGCGCGTCG GCTGCGCCAC GCGGTGCAGG AGGCGTACGC CCGCGGTTAC
CTCGGAAGGA ACATCCTGGG CAGCGGGTTC GACCTGGACC TGGTGGTGCA CTCGGGAGCC
GGGGCCTACA TCTGTGGCGA GGAGACCGCG CTGCTGGACT CGCTGGAGGG GTTCCGCGGT
CAGCCCCGGC TGCGCCCACC GTTCCCGGCG ACCCACGGCC TGTACGCCAG CCCGACGGTG
GTGAACAACG TCGGCACCAT CGCTTCCGTG CCGTACATCG TCTCGGGCGG TGCGGACTGG
TGGCGGACCA TGGGCACGGA GAAGTCCTCC GGGCCGATGA TCTATTCGCT CTCCGGTCGA
GTCGTCAACC CGGGCCAGTA CGAGTGCTCG ATGGGGATCA CGCTCCGCGA GCTGATCGAG
CTGGCTGGCG GCATGCAGCC GGGCCACTCC CTGCGGTTCT GGACGCCGGG CGGTTCGTCG
ACGCCGCTGC TCACCGCCGA GCACCTGGAC GTGCCGTTGG ACTTCGAGGA GGTGGCGGCG
GCGGGGTCGA TCCTCGGTAC CACGGCGACG CAGATCTTCT CCGACCAGGA CTGCCCGGTG
TACGCGACGT ACCGGTGGCT GGAGTTCTAC CACCACGAGT CGTGCGGCAA GTGCACCCCG
TGCCGCGAGG GCAACTACTG GATGGTCCGG GTCTACCGCC GGATCCTCTC CGGCCAGGGC
ACCCAGGAGG ACCTGGACAC GCTCCTGGAC ACCTGCGACA ACATCCTCGG CCGCTCGTTC
TGCGGCCTCG GCGACGGCGC CACCAGCCCG GTGACCTCCT CCCTGAAGTA CTTCAAGCAG
GACTACCTCG ACTACATCGA GGGACGGACC GCGCCGAAGC TCTCCGACAA GCAGCTGGTG
GGGGCCCACT GA
 
Protein sequence
MTTPRPETLA KLTPVLTKRW LSPDAWRIGT YERLDGYAAL RKALGAHPDD LIQLVKDSGL 
RGRGGAGFPT GLKWGFIPQE PDGRSPTRPH YLVVNADEGE PGTCKDLPLM THDPHSLIEG
VIIASYAIRA NRAYIYIRGE AVHAARRLRH AVQEAYARGY LGRNILGSGF DLDLVVHSGA
GAYICGEETA LLDSLEGFRG QPRLRPPFPA THGLYASPTV VNNVGTIASV PYIVSGGADW
WRTMGTEKSS GPMIYSLSGR VVNPGQYECS MGITLRELIE LAGGMQPGHS LRFWTPGGSS
TPLLTAEHLD VPLDFEEVAA AGSILGTTAT QIFSDQDCPV YATYRWLEFY HHESCGKCTP
CREGNYWMVR VYRRILSGQG TQEDLDTLLD TCDNILGRSF CGLGDGATSP VTSSLKYFKQ
DYLDYIEGRT APKLSDKQLV GAH