Gene Sare_4460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4460 
Symbol 
ID5704951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5041079 
End bp5042404 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID641273876 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001539225 
Protein GI159039972 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCTT CCAACTACGC CACCGAGCGG GAGACCGCCG AGGGTAAGGT CTTCACCGTC 
ACCGGCGGCG ACTGGGACGT GGTCGTCTCC GGCACCGACC CGATCAACGA CGAGCGGATC
GTCGTCAACA TGGGCCCGCA GCACCCGTCC ACGCACGGGG TGCTCCGGCT GGTGCTGGAG
CTGGAGGGTG AGACGGTCCG CGAGGCCCGG TCGGTCGTCG GCTACCTGCA CACCGGGATC
GAGAAGAACC TGGAGTTCCG TAACTGGGTG CAGGGCTCGA CCTTCGTGAC CCGGATGGAC
TACCTCGCCC CGCTGTTCAA CGAGACGGCG TACGCGTTGG CGGTGGAGAA GCTGCTCGGC
ATCGAGGAGC AGATCACCGA ACGGGCCACC ACCATCCGGG TCCTGATGAT GGAGCTCAAC
CGGATCTCCT CGCACCTGGT CTGGGTCGCC ACCACGGCCA TGGAGCTGGG TGCGATCAAC
ATGATGCTGT ACGGCTTCCG GGAGCGGGAG TACATCCTGG AGATCTTCGA GCTGATCACC
GGGCTGCGGA TGAACCACGC GTACGTCCGC CCGGGTGGGG TGGCTCAGGA CGTGCCGGAC
GAGGCGATCG CCAAGATCCG CGACTTCCTG AAGCTGATGC CGAAGAAGCT CGAGGAGTAC
GAGAAGATGC TCTCCGGCCA GCCGATCTGG CTGGAGCGTA CGCAGAACGT CGGGGTGCTC
GACGCGACCG GTTGCCTCGC GCTCGGCGTG ACCGGACCGG TGCTGCGCTC CGCCGGCCTC
GCCTGGGACC TGCGCAAGAC CATGCCGTAC TGCGGCTACG AGACGTACGA GTTCGACGTG
CCGACCCACA CCGATGGTGA CGTGTGGGGC CGGTATCTGG TTCGGCTCGC CGAGATCCGG
GAGTCGTTGA AGCTCGTCGA GCAGGCGGTG GACCGGCTTC GGCCGGGTCC GGTGATGGTG
GCGGATCGGA AGATCGCCTG GCCGGCGCAG CTCGCCATCG GGGTCGACGG CATGGGCAAC
TCACTGGAGC ACGTAGCGAA GATCATGGGG CAGTCGATGG AGTCGCTGAT CCATCACTTC
AAGCTCGTCA CCGAGGGCTT CCGGGTTCCA CCCGGCCAGG TGTACGTGGC CCTCGAGGCG
CCCCGGGGCG AGCTGGGCGT GCACGCGGTC TCCGACGGGG GGACCCGCCC GTACCGGGTG
CACTACCGGG AGCCGAGCTT CGTCAACCTC CAGGCCCTGC CGGCGATGGC CGAGGGCGGC
CTGATCGCCG ACGTGATCGC GGGTGGCGCC TCGCTGGACC CGGTGATGGG TGGGTGTGAC
CGGTGA
 
Protein sequence
MSASNYATER ETAEGKVFTV TGGDWDVVVS GTDPINDERI VVNMGPQHPS THGVLRLVLE 
LEGETVREAR SVVGYLHTGI EKNLEFRNWV QGSTFVTRMD YLAPLFNETA YALAVEKLLG
IEEQITERAT TIRVLMMELN RISSHLVWVA TTAMELGAIN MMLYGFRERE YILEIFELIT
GLRMNHAYVR PGGVAQDVPD EAIAKIRDFL KLMPKKLEEY EKMLSGQPIW LERTQNVGVL
DATGCLALGV TGPVLRSAGL AWDLRKTMPY CGYETYEFDV PTHTDGDVWG RYLVRLAEIR
ESLKLVEQAV DRLRPGPVMV ADRKIAWPAQ LAIGVDGMGN SLEHVAKIMG QSMESLIHHF
KLVTEGFRVP PGQVYVALEA PRGELGVHAV SDGGTRPYRV HYREPSFVNL QALPAMAEGG
LIADVIAGGA SLDPVMGGCD R