Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2160 |
Symbol | |
ID | 4599220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2309634 |
End bp | 2311124 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639776763 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_923356 |
Protein GI | 119716391 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCG GCATCACCGC GCCGACAGAC GCGATCTCGA TGGACATGCA GATGCTCATC GGAGGCAACT GGGTGGACGC GCTGGGCGAG GAACGCATCC CCGTCGAGAG CCCCAGCACG ATGACCACGA TCGGATCCGT TCCGCGCGCA CGCTCCGTCG ACATTGATCG TGCAGTGGTC GCGGCGCGCC AGTCGTTCCC GGCCTGGCGT GATACACCGC CACGGCAACG GGGGCGACTG CTCGCACGAA TCGCTGACGC CCTGGAACCG TTGGCCGAAG AACTTGCTCG AACAATCTCC ACCGAGAACG GCAACGCGAT TCGGACACAG TCTCGCGGAG AGGTCGCGTT CTCCGTCGAC GTATTCCGGT ACTTCGGGGG GATTGCAAGC GAGGCCAAGG GAGAGACCAT TCCGCTGGGA AGCACAGTCC TCGACTACTC CCGTCGTGAG CCTTTCGGCG TCGTCGGTGC GATCGTTCCC TGGAATGCGC CCTTACAGCT CAGCGCCATG AAGATCGCTC CGGCTTTGGC AATGGGAAAC ACCATCGTCC TCAAGGTTGC CGAAGATGCT CCGCTGGCGG TGCTCCGGCT GGCCGAGGTT GCCAACCAAG TCCTGCCGGC GGGCGTCCTC AACGTCATCC CAGGGTATGG CGACGAAGCA GGCGAAGCGC TCATTCGCCA TGCCGACGTC GATAAGTTGA CCTTCACGGG CTCGACTGCG ATCGGCAGTC ACGTCATGGC GACGGCCGCG GAAAGAATCG TTCCAGTCTC GCTGGAGCTC GGAGGGAAGA ACCCACAGAT AGTCTTTCCG GACGCGGACA ACGACGAAGT AGCACGTGGC GCCATCATGG CAATGCGGTT CGCTCGCCAA GGCCAGTCGT GTACTGCAGG GTCGCGTCTA TTCGTGCACT CCTCAATCTT CGATTCCTAC CTCGATCGAT TCGTTGGCGC GCTACGCGAA CTCAGGGTCG GTGACCCATT GGACGAAGCC TCCGACATCG GCGCCATCGT CAATAGGAAG CAGTTCGACA AAGTCTGCGG CTACATCTCG GAGGGCATCG AATCGAACTC GACCGTGCTG CTCGGTGGAC TCCCGCCCTC CGACGGGCCA CTAGCGAACG GCTACTACGT CACGCCCACA GTGCTGTCGC AGGTCGATCC TGCGTGGCGC CTGGCTCGCG AGGAGATCTT CGGGCCCGTC GTGTGTGCCA TCCCGTGGAC CGATGAGGAG GAGGTCTTGG AACTCGCCAA CCGGTCCCAC TATGGGCTGA GCGCGTTTAT TTGGACCTCT AACCTCGGAG CTGCCTTGCG AGCGGCACAT GCGGTCGAGA GCGGATGGGT TCAGGTGAAT CAAGGCGGCG GTCAAGTACT TGGCCAGTCC TACGGAGGGT TTAGGCGGAG CGGCATCGGG CGCGAGTTCT CACTCGAAGG AATGCTCGAC AGCTACACCC ATCGCAAGCA CGTCTCGATC AATCTCGCTC CCATCGGATA G
|
Protein sequence | MSLGITAPTD AISMDMQMLI GGNWVDALGE ERIPVESPST MTTIGSVPRA RSVDIDRAVV AARQSFPAWR DTPPRQRGRL LARIADALEP LAEELARTIS TENGNAIRTQ SRGEVAFSVD VFRYFGGIAS EAKGETIPLG STVLDYSRRE PFGVVGAIVP WNAPLQLSAM KIAPALAMGN TIVLKVAEDA PLAVLRLAEV ANQVLPAGVL NVIPGYGDEA GEALIRHADV DKLTFTGSTA IGSHVMATAA ERIVPVSLEL GGKNPQIVFP DADNDEVARG AIMAMRFARQ GQSCTAGSRL FVHSSIFDSY LDRFVGALRE LRVGDPLDEA SDIGAIVNRK QFDKVCGYIS EGIESNSTVL LGGLPPSDGP LANGYYVTPT VLSQVDPAWR LAREEIFGPV VCAIPWTDEE EVLELANRSH YGLSAFIWTS NLGAALRAAH AVESGWVQVN QGGGQVLGQS YGGFRRSGIG REFSLEGMLD SYTHRKHVSI NLAPIG
|
| |