Gene Saro_2294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2294 
Symbol 
ID3916612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2435747 
End bp2437756 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content65% 
IMG OID640445050 
ProductNADH dehydrogenase subunit G 
Protein accessionYP_497565 
Protein GI87200308 
COG category[C] Energy production and conversion 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) 
TIGRFAM ID[TIGR01973] NADH-quinone oxidoreductase, chain G 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAGG TTAAGGTAGA CGGCGTAGAA CTCGAAGTTC CGGCAGGCGC CACCGTCCTG 
CAGGCGTGCG AGCTGGCCGG CAAGGAAATC CCGCGCTTCT GCTATCACGA ACGGCTGAGC
ATTGCCGGCA ATTGCCGCAT GTGCCTGGTC GAAGTGAAGC CCGGACCGCC GAAGCCGCAG
GCTTCCTGCG CGCTCCCGGC GACCGAGGGC CAGGAAATCC GCACGGACTC CGAGATGGTC
AAGAAGGCGC GCGAAGGGGT GATGGAGTTC CTCCTCATCA ACCACCCGCT CGACTGCCCG
ATCTGCGACC AGGGCGGTGA ATGCGATCTG CAAGACCAGT CGGTCGCCTA TGGCCGCGGT
GCCTCGCGCT ATCACGAAAA CAAGCGCGCG GTGACCGAGA AGTACATGGG CCCGCTCATC
AAGACGACGA TGACGCGCTG CATTCATTGC ACCCGCTGCG TGCGTTTCTC GGAAGAGGTC
GCGGGCGTGG ACGAGATCGG TGCGCTTTAT CGCGGCGAGA CGATGCAGAT TTCGACCTAT
CTCGAGAAGG CTGCGGCGCA CGAACTTTCG GCCAACGTGA TCGACCTCTG CCCGGTCGGT
GCGCTGACTT CGCGCCCCTA TGCGTTCGAA GCGCGTCCGT GGGAGCTGAA GAAGACGCTG
TCGATCGACG TTTCGGATGC TCTGGGTTCG AACATCCGGC TCGACAGCCG TGGCCGCGAG
GTGCTGCGCA TCCTGCCGCG CGTGAACGAC GACGTGAACG AGGAATGGCT GTCCGACCGT
GGCCGCTACA TGGTCGACGG GCTGACCCGC CGCCGCCTCG ACAAGCCTTG GCTGCGCCGT
GACGGCAAGC TGGTCGCAGC GACCTGGGCC GAAGCGTTCG AAGCCGTTGC GAAGGTCAAC
CCGGGTTCGT CTGTCGCGGT CATCGCTGGC GATCTGGTCG ATTGCGAGAC GATGTTCGCG
GCAAAGAAGC TGGCCGGCGC ACTGGGATCG TCCCTGCTCG AAGGCCGCCA GACCGGTTTG
GCTTACGACA CGTCGAACCT CACCGCTGTG AACTTCAACT CGACGCTGGC TGGCATCGAG
GACGCGGACG CCGTTCTGAT CGTCGGTTCG ATGATTCGTG ACGAGGCTCC TCTGCTCAAC
ACCCGCCTGC GCAAGGCGGC GAAGAAGGGC GCGAAGGTGT TCATCGTCGG CCCGCACTGG
GACCCGACCT ATCCGGCGAC GTTTCTGGGC GACGATCTGG CAGTGCTTGG AAACCTGCCG
GCCGAAGTCA GCGATGCGTT CGGTGCGGCA CAGAAGCCGG CGATCATTGT CGGCGGCGCG
GCGCTGGGCA AGGGTGCGCT GGCCGCGGGC TTGGCCTTCG CCGAAAAGTT CAACCTCGTC
CGTGAGGGCT GGAACGGCTT CAACGTCGTC CACATGGCGG CGAGCCGCAT GGGTGGCCTG
ATGCTCGGCT ATGCGCAGAA GGGTGGCATT GCCGACCTCG TTGCGGCCAA GCCGAAGATG
GTGATCTCGC TCGGTGCCGA CGAAGTGGAC TTCACCAGGT TCGCGGGCAG CATGATCGTC
CACATCGGCC ATCATGGTGA CAAGGCGGCG CACGCCGCCG ACGTGATCCT GCCGGCCGCC
GCGTTCAGCG AGAAGGACGG CACCTACGTC AACACCGAAG GCCGCGTGCA GTATGCGGAG
AAAGCCGTGT TCGCGCCGGG CGATGCCCGC GAGGACTGGA CGATCTTGCG CGCCATGGCC
GATGCGCTGG GAGTTTCGGT CGGCTTCGAC AGCTTCGAGC AGCTTCGCGC CGCCATGGTT
GCCGAAGTTC CGGCACTGGG GTTGGAAGGT CTGGCCGATT ACGGTGCGCT GCCTGCCGCG
TCTGCCGACG TGAAGGCCGA GGGCGTGATC GCGGGCTATC CGATCAAGGA CCGCTACCTG
ACCAACGCCA TCGCCCGCTC CAGCCCGACG CTGCAGCGCT GCTCGGCGGA ACTGCTCCAC
GGTGAAAGCT TCGCGGAGGC CGCGGAATGA
 
Protein sequence
MPKVKVDGVE LEVPAGATVL QACELAGKEI PRFCYHERLS IAGNCRMCLV EVKPGPPKPQ 
ASCALPATEG QEIRTDSEMV KKAREGVMEF LLINHPLDCP ICDQGGECDL QDQSVAYGRG
ASRYHENKRA VTEKYMGPLI KTTMTRCIHC TRCVRFSEEV AGVDEIGALY RGETMQISTY
LEKAAAHELS ANVIDLCPVG ALTSRPYAFE ARPWELKKTL SIDVSDALGS NIRLDSRGRE
VLRILPRVND DVNEEWLSDR GRYMVDGLTR RRLDKPWLRR DGKLVAATWA EAFEAVAKVN
PGSSVAVIAG DLVDCETMFA AKKLAGALGS SLLEGRQTGL AYDTSNLTAV NFNSTLAGIE
DADAVLIVGS MIRDEAPLLN TRLRKAAKKG AKVFIVGPHW DPTYPATFLG DDLAVLGNLP
AEVSDAFGAA QKPAIIVGGA ALGKGALAAG LAFAEKFNLV REGWNGFNVV HMAASRMGGL
MLGYAQKGGI ADLVAAKPKM VISLGADEVD FTRFAGSMIV HIGHHGDKAA HAADVILPAA
AFSEKDGTYV NTEGRVQYAE KAVFAPGDAR EDWTILRAMA DALGVSVGFD SFEQLRAAMV
AEVPALGLEG LADYGALPAA SADVKAEGVI AGYPIKDRYL TNAIARSSPT LQRCSAELLH
GESFAEAAE