Gene Saro_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1843 
Symbol 
ID3918403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1943572 
End bp1944699 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content65% 
IMG OID640444585 
Productalanine dehydrogenase/PNT-like 
Protein accessionYP_497117 
Protein GI87199860 
COG category[C] Energy production and conversion 
COG ID[COG3288] NAD/NADP transhydrogenase alpha subunit 
TIGRFAM ID[TIGR00561] NAD(P) transhydrogenase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.226414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGGG CAATGAAGAT CGCCGTCCTC AGGGAACGCG CGACGGGGGA GACCAGGGTT 
TCGGCAACGC CGGAAACCGT GAAGAAGTTC ATTGCCCTTG GGGCAACAGT TGCAATTGAA
GAAGGTGCAG GCATTACCGC CTCGATTTCC GACGAGGATT ATCGCGCGGT GGGCGCCGAA
GTCGTGTCCA GTCCGGCAAA CGGGGCGGAC ATCGTGCTTG GCGTCCAGGG CCCCGAGCCC
GAACTGCTGG CGGGCGTGAA GCCCGGTGCC TGGATCGTGG CCGGGCTTGA TCCATTCGTG
AAGCGCGCCC GCGTGGACGC TTATGCGGCC GCCGGCCTTG AAGCGCTGGC GATGGAGTTC
ATGCCGCGCA TTACACGTGC ACAGTCGATG GACATCCTGT CGTCGCAGTC GAACCTTGCC
GGCTACAAGG CCGTGCTGGT GGCCGCCAAC CTTTATGGTC GCGCGTTCCC GATGATGATG
ACGGCGGCGG GCACCGTCTC TGCCGCCAAG GCTTTTGTCA TGGGCGTCGG CGTTGCCGGC
CTCCAGGCCA TCGCCACCGC TCGCCGTCTC GGCGCGCAGG TTTCGGCGAC CGACGTCCGT
TCGGCAACGA AGGAGCAGAT CCAGTCGCTC GGTGCCAAGC CGATCTTCGT GGAAAGCGTT
GCGGGCATCG AAGGCGAGGG CGCCGGCGGC TATGCCACGG AAATGTCCGA GGAATACCAG
AAGGCCCAGG CCGAGCTGGT GAGCGCGCAT ATCGCCAAGC AGGACATCGT CATCACCACG
GCGCTGATCC CGGGCCGCGC CGCGCCGCGC CTGATTTCCG ATGCGCAGAT TGCCACGATG
AAGCCCGGTT CGGTCATCTT CGACCTTGCC GTGGCCCAGG GCGGCAACGT CGAGGGTTCG
GTGCCCGACC AGGTTGTCGA GAAGCACGGC GTGAAGATCG TCGGCTACTC GAACACGCCC
GCGCACCTGC CGGCCGACGC TTCGGCGCTG TTCAGCCGCA ACCTCTACAA CTTCCTCTCG
GCCTTCTGGG ACAAGGAACA GGGCAAGCCC GTTCTGGACG AGGAAATCGG CAACGCCATC
CGCCTGACGC AGGGCGGCAA GGTGGTCAAC GAACGTCTGC TCGGCTGA
 
Protein sequence
MAGAMKIAVL RERATGETRV SATPETVKKF IALGATVAIE EGAGITASIS DEDYRAVGAE 
VVSSPANGAD IVLGVQGPEP ELLAGVKPGA WIVAGLDPFV KRARVDAYAA AGLEALAMEF
MPRITRAQSM DILSSQSNLA GYKAVLVAAN LYGRAFPMMM TAAGTVSAAK AFVMGVGVAG
LQAIATARRL GAQVSATDVR SATKEQIQSL GAKPIFVESV AGIEGEGAGG YATEMSEEYQ
KAQAELVSAH IAKQDIVITT ALIPGRAAPR LISDAQIATM KPGSVIFDLA VAQGGNVEGS
VPDQVVEKHG VKIVGYSNTP AHLPADASAL FSRNLYNFLS AFWDKEQGKP VLDEEIGNAI
RLTQGGKVVN ERLLG