Gene Sare_3457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3457 
Symbol 
ID5708206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3984538 
End bp3987288 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content73% 
IMG OID641272884 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_001538250 
Protein GI159038997 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit
[COG2111] Multisubunit Na+/H+ antiporter, MnhB subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0550495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTCG TGGCCGTCCT CGGCTGGCAG CTCCTTCTGG CCGTCGGAGC GCCGGCGCTG 
ACCCGGCCAC TGGGGCGCAA CGCCGGTTAC GTACTCGCCG CCGGCTACCT GGCCGGGGCG
GGCCTGCTGA CCACCGAGAG TCCAGTGCTC CTCGCCGACC AGGCGGTCAC CGTCTCCTGG
CCATGGCTGC CCTCACTGGG GGTCTACGCG GCGCTGCGGA TGGACGCGCT CAGTTTCGTC
TTCGCCCTGC TGGTGCTGGG TGTGGGTGCG TTCGTCATGG CGTACTGCCC GCGCTACCTG
AGCGCGGGCA GCCGGCACAC CCGGGTGTAC GTCACGATGA CCCTGTTCGC CGGGGCGATG
CTGGGCCTGG TGCTCGCCGA CGACCTGCTG GTGCTGTTCG TCTTCTGGGA GCTGACGAGC
ATCCTGTCGT TCCTCCTCAT CGGGCAGGAC GGCCGCACGA AGGTGCGCGG CCCGGCGATC
CAGGCCCTCA TCGTCACCAC GACCGGTGGC GTGGCCCTGC TGGTGGCGGT GGTCGTGCTC
AGCGCGAGCC TCGGCACCAG CGACCTCGAT CGGATCCTGG CCGACCCAGG CCGGCTGGCG
ACCGGGCCCG CCTGGGCGGC GGGTGCCCTG GTGATCCTCG CGGCCATCAC GAAGTCGGCG
CAGGTGCCGT TTCATTTCTG GCTGCCCGGC GCCATGGTCG CCCTCACCCC GGTCAGCGCC
TACCTGCACG CCGCCACGCT GGTGAAGGCG GGTATCTACC TGCTGATGCG TTTCTCGGCG
TTGTTCGGCG GACAGTGGCC GTGGGACCTC ACGTTGATCG GCCTCGGACT GCTGACCGCG
ATCGTCGGCG CGCTGTTGGC GCTTCGCCAA CACGACCTCA AGGCGCTGCT CGCGTACTCC
ACGGTGAGTC AGCTGGGCCT GCTGGTCGGC GTGATCGGGG TCGGCACGCC CGCCTCGGAC
GCGGCGGCGA TCCTCTACAC GATCGCGCAC GCCCTGTTCA AGGCGACCTT GTTCATGCTG
GTGGGAATCA TCGACCGGCA GGCGGGCAGC CGGGACATCC GGGCCCTGTC CGGGCTGCGT
CGGGCGATGC CGATCACCGT GATGCTCACC GTGCTGGCGG CCATGTCGCT CGCCGGGCTA
CCGCCGACGA TCGGGTTCGT CGGCAAGGAG GCGATCTTCG ATGCGCTCAG TGGCGTACAC
GGGGTGCCCT GGCTCGGGTG GGTGGCGGCC GGCCTGGCGG TACTCGCCTC GACCCTCACC
TTCGCGTACG CGGCGCGGTT GGTCTACGGC GTCCTGTCCG GCCCGATCCG GCAGCGCGAG
TTGCACGAAC CGGCCTGGTC GTTCCTCGCC CCGGCGGCCG TCGCGGCGGT CATCGCCACC
GCCCTCGGTC CGACCGTCGC CGTCCTCAGC CCGATGGTCG AACGGGCCGC CAGCGACGCC
CGCCCTCAGG GGCAGGCTCC CTACCTGGCC TTCTGGCACG GCTTCACGCC GGCGCTCGGG
CTGTCCGTGC TCACCGTCCT GCTCGGGACG CTGCTGTTCC TGTGCCGAGG CCGGACCGAC
TCGATGCTTG CGGCGATCCC GACCACGTCG CCGTTCACCG GCTACCTGGA GCGGTTCCGG
CGTACGCTGC TGCGCCTCGG TGCGGTCGTG GCCCGGCCCG CCCGGGTCAC CGCTCCGGCT
CCCTATCTGA CCCGGCCGCT ACTCGCCGTG GTGGCGCTGG CGGCGGTCGC CGTACTGCTG
GTCGGCCCGC TGCCCGCCGC GCAGGCGGAC CCCACCCGGG GTGGGGACTG GCTCGTGCTC
GGGCTGCTGC TGGTCCCCCT CGCCGGACTG GTCAGCACCC GCTCTGCGCT CGCGGCGGTC
GCGCTGACCG GCGCGGTCGG CCTGATTCTG GCAGCCTGGT TCCTCACGGT GGGTGCGCCG
GACGTGGCGC TCACCCTGAT GCTGGTCGAG GTGCTCACCA CAGTGGTGGT CATGCTCGCC
CTGCGGGGCC GGCCCGGCCG GCTGGTCGCG CCGGGCCGGC GGGCGGGGAT CGTCGCCGGT
GCCGGGCTGG CGGTGCTCGT CGGGGCGGCG GCGACCGCGG CGACGGCGGC ACTGACCGGC
CGCCGCGACC TGTCGCCGGC CGGCGACTGG TACCTGCGGG AGGCATCGGC GACCACCGGC
GGGGAGAACC TGGTCAACAC CATCCTCGTG GACTTCCGGG CCCTGGACAC CCTCGGCGAG
GCGGTGGTGC TCGGGGTGGT GGCGGTCGGA CTGGTCGGGC TCGCCCGGCC AGACGAAGCC
GAACGCCCCC GTCCGGCTGC CGTACGGCAC GTCGTCGATC CGGTGCTCGA ACTGGCGTAC
CGCGTACTGG CCCCGGTCAT GCTCGCCGCG TCCGCGCTGT TGTTCCTCCG CGGCCACGAG
GAACTCGGTG GTGGGTTCAT CGCCGCGCTG CTCGCCGGTA CCGCGGTCGG ACTCGGTCAC
CTGGCCCATC CCGCGGGAGC GCCGCTGCTC GGCCGCCTCC GAGCGGTGCC GCTGCTGACG
GTCGGGCTGC TGCTGGCGCT CGCCGCGGGT CTGGCGCCGC TCGCCGCGGG CCGGGTCTTT
CTCACTCCGA TCAAGTTTCC CCTGCCTGGG GTCGGTTCGG TGACGTCGGC TCTCGTGTTC
GACCTGGGTG TGTACCTGAT GGTTCTGGCC CTGGTCGTGG CGGCTGTTCG GCGGTTGGAC
AGCGCGCCGG CCGGGCCCGC GCCCGGTCCG GTCCGGGAGC GGGTCCGATG A
 
Protein sequence
MILVAVLGWQ LLLAVGAPAL TRPLGRNAGY VLAAGYLAGA GLLTTESPVL LADQAVTVSW 
PWLPSLGVYA ALRMDALSFV FALLVLGVGA FVMAYCPRYL SAGSRHTRVY VTMTLFAGAM
LGLVLADDLL VLFVFWELTS ILSFLLIGQD GRTKVRGPAI QALIVTTTGG VALLVAVVVL
SASLGTSDLD RILADPGRLA TGPAWAAGAL VILAAITKSA QVPFHFWLPG AMVALTPVSA
YLHAATLVKA GIYLLMRFSA LFGGQWPWDL TLIGLGLLTA IVGALLALRQ HDLKALLAYS
TVSQLGLLVG VIGVGTPASD AAAILYTIAH ALFKATLFML VGIIDRQAGS RDIRALSGLR
RAMPITVMLT VLAAMSLAGL PPTIGFVGKE AIFDALSGVH GVPWLGWVAA GLAVLASTLT
FAYAARLVYG VLSGPIRQRE LHEPAWSFLA PAAVAAVIAT ALGPTVAVLS PMVERAASDA
RPQGQAPYLA FWHGFTPALG LSVLTVLLGT LLFLCRGRTD SMLAAIPTTS PFTGYLERFR
RTLLRLGAVV ARPARVTAPA PYLTRPLLAV VALAAVAVLL VGPLPAAQAD PTRGGDWLVL
GLLLVPLAGL VSTRSALAAV ALTGAVGLIL AAWFLTVGAP DVALTLMLVE VLTTVVVMLA
LRGRPGRLVA PGRRAGIVAG AGLAVLVGAA ATAATAALTG RRDLSPAGDW YLREASATTG
GENLVNTILV DFRALDTLGE AVVLGVVAVG LVGLARPDEA ERPRPAAVRH VVDPVLELAY
RVLAPVMLAA SALLFLRGHE ELGGGFIAAL LAGTAVGLGH LAHPAGAPLL GRLRAVPLLT
VGLLLALAAG LAPLAAGRVF LTPIKFPLPG VGSVTSALVF DLGVYLMVLA LVVAAVRRLD
SAPAGPAPGP VRERVR