Gene Noca_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1115 
Symbol 
ID4599368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1178931 
End bp1180451 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content66% 
IMG OID639775711 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_922318 
Protein GI119715353 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCAAA CGACGGAGTT CGAGCGCCTG AGCGGGACCT ACTTCCACGA GGGCAAGCAC 
CTCCCCAGCG GCTCCGCTGT CCATCGACCC GTGATCGATC CGGCGACCGG CATGACCCTC
GGCTCCTTCG CGATCGTCAC CCCAGAAGAG GTCGACGCGG TCGTCGAAGC AGCGAACCGG
GCACAGCAGG GGTGGTGGAA GGAAAGCGCC CTGCACCGTG CCGAGGTGCT CCACGACGTG
GCCCGCAGGT TGCGTGCCCT CAAGCCGGAA CTGGCCGAGA TCCTGACCCG GGAGACCGGC
AAGCCGTTCA AGGAGAGCGC CGACGAGGTC GACTGGTCGG TCAGCAACCT GGATTACTAC
GCCGAGCTCG GACGGCACTC CATCGGGTCG GTTCTCGGGC CATCGATCGC CGGACAGACG
CACTACACCC TCAAGGAGCC GATGGGCACC GTCGTCGTCA TCCTGCCGGC GAACTATCCA
CTCCTCCTCC TGATCTGGGA GGCCGCGGCC GCTCTCGCCG CCGGGAACGC CGTTGTCGTC
AAGCCCTCGG AGTGGGCATC TCTGACGACG CTGAAGCTGA TGGAGATCTT CGAGCCGCTC
CCGGCCGGCC TCGTGGGCTG TGTCACCGGA GGAGGCGGTG TAGGTGCTCG GCTGGTGGAG
CACCGCAACA CCCACCAGGT CTGTTTCACC GGGAGCGTTC CCACCGGCAA GGTCGTCGCC
GAAGCCTGTG GCCGCCGGTT CAAGCCCACA CTGATCGAGG CGTCCGGCAA CGACGCCTTC
ATCGTGATGC CCTCGGCCCC GCTCGAGGTC GCTGCGCGAG CAGCGACCTT CGCCGCCTTC
TTCAACTGCG GTCAGGTCTG TACCTCCGCG GAGCGGGTCT TCGTCCACGA AGACATCCAC
GACCAGTTCG TCGAGCTCTT CGTCGCCGAG GCCGCCAGGT TGCGCATCGG CAACGGCCTC
GACCAGGTCG ACATCGGCCC GATGGAGCAC GCCGGCGAAC GTGACCGGTT CGAGCAGGTC
GTCACCCGGG CCATCGAACA AGGCGCCAAG GTCGAGATCG GCGGCGGCAG ACCGTCCGAC
CTGCCGTCGG AACTCGACGG TGGCTTCTTC GTCGAACCGA CCATCATGAC CGGGGTCACC
CCCGATATGG ACGTCGTCAA CGGTGAGGTG TTCGGGCCCC TCGCACCCAT CGTCAAGGTG
TCGAGCCTCG ATGAGGCCAT CCGCCTGACC AACGACTCCG ACTTCGGGCT CGGGGCCACC
GTGTACACCA CCGACGCGGC CGAGATCCAC CGCGCTACGA ATGAGATCGT CTCCGGAATG
GTGTGGATCA ACGCGCCGAT CCTCGACAAC GACGCAGGGC CGTTCGGTGG CCGCAAGATG
TCGGGAATCG GTCGGCAGCT CGGGAGCGAA GGCTTGGACA CCTTCCGGCA CACCAAGCTC
GTCATGATCG ACCCATTCGC TTCCCAGCAT GACTTCTGGT GGTTCCCGTA CGCCTCGGAC
GAGGCATGGC CGAGCTCCTG A
 
Protein sequence
MDQTTEFERL SGTYFHEGKH LPSGSAVHRP VIDPATGMTL GSFAIVTPEE VDAVVEAANR 
AQQGWWKESA LHRAEVLHDV ARRLRALKPE LAEILTRETG KPFKESADEV DWSVSNLDYY
AELGRHSIGS VLGPSIAGQT HYTLKEPMGT VVVILPANYP LLLLIWEAAA ALAAGNAVVV
KPSEWASLTT LKLMEIFEPL PAGLVGCVTG GGGVGARLVE HRNTHQVCFT GSVPTGKVVA
EACGRRFKPT LIEASGNDAF IVMPSAPLEV AARAATFAAF FNCGQVCTSA ERVFVHEDIH
DQFVELFVAE AARLRIGNGL DQVDIGPMEH AGERDRFEQV VTRAIEQGAK VEIGGGRPSD
LPSELDGGFF VEPTIMTGVT PDMDVVNGEV FGPLAPIVKV SSLDEAIRLT NDSDFGLGAT
VYTTDAAEIH RATNEIVSGM VWINAPILDN DAGPFGGRKM SGIGRQLGSE GLDTFRHTKL
VMIDPFASQH DFWWFPYASD EAWPSS