Gene Noca_2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2160 
Symbol 
ID4599220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2309634 
End bp2311124 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content62% 
IMG OID639776763 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_923356 
Protein GI119716391 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCG GCATCACCGC GCCGACAGAC GCGATCTCGA TGGACATGCA GATGCTCATC 
GGAGGCAACT GGGTGGACGC GCTGGGCGAG GAACGCATCC CCGTCGAGAG CCCCAGCACG
ATGACCACGA TCGGATCCGT TCCGCGCGCA CGCTCCGTCG ACATTGATCG TGCAGTGGTC
GCGGCGCGCC AGTCGTTCCC GGCCTGGCGT GATACACCGC CACGGCAACG GGGGCGACTG
CTCGCACGAA TCGCTGACGC CCTGGAACCG TTGGCCGAAG AACTTGCTCG AACAATCTCC
ACCGAGAACG GCAACGCGAT TCGGACACAG TCTCGCGGAG AGGTCGCGTT CTCCGTCGAC
GTATTCCGGT ACTTCGGGGG GATTGCAAGC GAGGCCAAGG GAGAGACCAT TCCGCTGGGA
AGCACAGTCC TCGACTACTC CCGTCGTGAG CCTTTCGGCG TCGTCGGTGC GATCGTTCCC
TGGAATGCGC CCTTACAGCT CAGCGCCATG AAGATCGCTC CGGCTTTGGC AATGGGAAAC
ACCATCGTCC TCAAGGTTGC CGAAGATGCT CCGCTGGCGG TGCTCCGGCT GGCCGAGGTT
GCCAACCAAG TCCTGCCGGC GGGCGTCCTC AACGTCATCC CAGGGTATGG CGACGAAGCA
GGCGAAGCGC TCATTCGCCA TGCCGACGTC GATAAGTTGA CCTTCACGGG CTCGACTGCG
ATCGGCAGTC ACGTCATGGC GACGGCCGCG GAAAGAATCG TTCCAGTCTC GCTGGAGCTC
GGAGGGAAGA ACCCACAGAT AGTCTTTCCG GACGCGGACA ACGACGAAGT AGCACGTGGC
GCCATCATGG CAATGCGGTT CGCTCGCCAA GGCCAGTCGT GTACTGCAGG GTCGCGTCTA
TTCGTGCACT CCTCAATCTT CGATTCCTAC CTCGATCGAT TCGTTGGCGC GCTACGCGAA
CTCAGGGTCG GTGACCCATT GGACGAAGCC TCCGACATCG GCGCCATCGT CAATAGGAAG
CAGTTCGACA AAGTCTGCGG CTACATCTCG GAGGGCATCG AATCGAACTC GACCGTGCTG
CTCGGTGGAC TCCCGCCCTC CGACGGGCCA CTAGCGAACG GCTACTACGT CACGCCCACA
GTGCTGTCGC AGGTCGATCC TGCGTGGCGC CTGGCTCGCG AGGAGATCTT CGGGCCCGTC
GTGTGTGCCA TCCCGTGGAC CGATGAGGAG GAGGTCTTGG AACTCGCCAA CCGGTCCCAC
TATGGGCTGA GCGCGTTTAT TTGGACCTCT AACCTCGGAG CTGCCTTGCG AGCGGCACAT
GCGGTCGAGA GCGGATGGGT TCAGGTGAAT CAAGGCGGCG GTCAAGTACT TGGCCAGTCC
TACGGAGGGT TTAGGCGGAG CGGCATCGGG CGCGAGTTCT CACTCGAAGG AATGCTCGAC
AGCTACACCC ATCGCAAGCA CGTCTCGATC AATCTCGCTC CCATCGGATA G
 
Protein sequence
MSLGITAPTD AISMDMQMLI GGNWVDALGE ERIPVESPST MTTIGSVPRA RSVDIDRAVV 
AARQSFPAWR DTPPRQRGRL LARIADALEP LAEELARTIS TENGNAIRTQ SRGEVAFSVD
VFRYFGGIAS EAKGETIPLG STVLDYSRRE PFGVVGAIVP WNAPLQLSAM KIAPALAMGN
TIVLKVAEDA PLAVLRLAEV ANQVLPAGVL NVIPGYGDEA GEALIRHADV DKLTFTGSTA
IGSHVMATAA ERIVPVSLEL GGKNPQIVFP DADNDEVARG AIMAMRFARQ GQSCTAGSRL
FVHSSIFDSY LDRFVGALRE LRVGDPLDEA SDIGAIVNRK QFDKVCGYIS EGIESNSTVL
LGGLPPSDGP LANGYYVTPT VLSQVDPAWR LAREEIFGPV VCAIPWTDEE EVLELANRSH
YGLSAFIWTS NLGAALRAAH AVESGWVQVN QGGGQVLGQS YGGFRRSGIG REFSLEGMLD
SYTHRKHVSI NLAPIG