Gene Sare_4451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4451 
Symbol 
ID5704942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5029456 
End bp5030988 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content66% 
IMG OID641273867 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_001539216 
Protein GI159039963 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0655371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACT TCCCGTTTCT TTCGGTGCTC ACCGTGGCGC CGCTGGTCGG TGCCCTGGTG 
GTCGCCGTCC TGCCTCGCCG TCGGCCGGAA CTGGCCAAGC AGGTGGCGCT CGGCTGGTCG
CTGCTGGTGC TGGCGCTGTC GGTGGTCATG TGGGTGACCT GGCAGACCGG TGGCGAGCGG
TTCCAGTTCC GCGAGTCCTA TCCGTGGATT CCGAACTGGG GCGTCAACTT CACCTTCGCC
GCGGACGGCA TCGCGCTGGT CATGCTGATG CTGATCGCGG TGCTGGTGCC ACTGGTGATC
CTGGCCTCCT GGCACGACGC CGAATCGTCG AAGCGATCGG TACCGGTCTA CTTCGCACTG
TTGCTGGTTC TCGAGTGCAC GATGATCGGC GTGTTCGCCG CCGCCGACGT CTTCCTGTTC
TACGTGTTCT TCGAGGTCAT GCTCGTGCCG ATGTACTTCC TCATCGGTAG TTACGGCGGC
CACCAGCGGC AGTACGCGGC CGTGAAGTTC TTCCTCTACT CCCTGGTCGG CGGCCTGTTC
ATGCTCGCCG CGGTGATCGG CCTGTGGGTG GTCGGCGGAA AGACGTTCGA CTGGGTGGCG
TTGTCACAGG TCGACATCTC CACGGGCGCG GAACGTTGGC TGTTCCTCGG CTTCTTCGTC
GCCTTCGCGA TCAAGGCACC GTTCTTCCCG TTCCACACTT GGCTGCCGGA CGCCGGTGGC
GCTGCCCCGG CTGGGGCCGC GGCGTTGCTG GTCGGCGTGC TCGACAAGGT GGGAACGTTC
GGCATCCTGC GCTACTGCCT TCCGCTGTTC CCGGACGCGG CGAAGTGGTT CGCCCCGTGG
GCGCTGGCGT TGGGCCTGAT CGGCATCATC TACGCGGCGC TGCTTGCCGT CGGTCAGAAC
GACCTGAAGC GGCTGGTGTC GTACACCTCG ATCGCGCACT TCGGCTTCAT CGGCGTCGGT
ATCTTCGCGT TCACCAGCCA GGCAGCCACC GGTGCGGTGC TCTACATGGT CAACCACGGG
CTCGCCACCG GTCTGCTCTT CCTGGTGGTC GGGATGCTGG TCGCCCGTCG GGGCTCCGCG
CTGATCAGCG ACTTCGGCGG CGCCGGCAAA CTCGTGCCGC TGCTGGCGGG GGTGCTCTTC
TTCGCCGGTC TCGCCTCGCT GGCGCTGCCC GGCACCGCAC CGTTCATCTC CGAGTTCCTG
GTGCTGATCG GCACCTTCTC GGTGAACAAG CCGGTGGCCG TGATCGCCAC CCTCGGGATC
ATCCTGGCCG CCGCGTACGT GCTCTGGATG GTGCAGCGCA CCACTCAGGG CACGCTGAAC
CCGGCACTGA CCGAGGTCGA CGGCATGAAA CGCGACCTCA ACCTGCGCGA GAAGGTCGTG
GTGGCCCCTC TGGTGGCGTT GATCGTGCTG CTCGGCTTCT ACCCGAAGCC GGTCACAGAC
GTGATCAACC CTGCCGTCCA GGCCACCATG CAGGATATCG GCAAGACTGA CCCGGCCCCG
TCGGCCGGCA CCACACAGGA GGCGAGCCGG TGA
 
Protein sequence
MSDFPFLSVL TVAPLVGALV VAVLPRRRPE LAKQVALGWS LLVLALSVVM WVTWQTGGER 
FQFRESYPWI PNWGVNFTFA ADGIALVMLM LIAVLVPLVI LASWHDAESS KRSVPVYFAL
LLVLECTMIG VFAAADVFLF YVFFEVMLVP MYFLIGSYGG HQRQYAAVKF FLYSLVGGLF
MLAAVIGLWV VGGKTFDWVA LSQVDISTGA ERWLFLGFFV AFAIKAPFFP FHTWLPDAGG
AAPAGAAALL VGVLDKVGTF GILRYCLPLF PDAAKWFAPW ALALGLIGII YAALLAVGQN
DLKRLVSYTS IAHFGFIGVG IFAFTSQAAT GAVLYMVNHG LATGLLFLVV GMLVARRGSA
LISDFGGAGK LVPLLAGVLF FAGLASLALP GTAPFISEFL VLIGTFSVNK PVAVIATLGI
ILAAAYVLWM VQRTTQGTLN PALTEVDGMK RDLNLREKVV VAPLVALIVL LGFYPKPVTD
VINPAVQATM QDIGKTDPAP SAGTTQEASR