Gene Noca_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4199 
Symbol 
ID4596713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4437625 
End bp4439193 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content70% 
IMG OID639778805 
Productaldehyde dehydrogenase 
Protein accessionYP_925383 
Protein GI119718418 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCCCA CAGGTGCCTC GCTCATCGCA GGCAGGTCGG TCGTCGGCAC TGCTGGCAGC 
ACGCGTGCCC ACAACCCCGC CACGGGCGAG GCTCTCGGCC CCGAGTTCGG TTACGCGGGC
CCCGAGGACC TCGCCGCCGC GACCCGCGCC GCGACTGAAG CCTTCGAGCC CTACCGGGCT
ACCTCCCCGA GCGAGCGGGC TGCGTTCCTC GACCTCATCG CGGACAACCT CGACGCGGCG
AGGGACGCCA TCGTCGCCCG CGCCGTCCTG GAGTCCGGTC TGACGGAAGC GCGGCTGTTC
GGCGAGCACG CCCGTACTGT CAACCAGCTC CGCCTGTTCG CACGCGAGGT CCGCCTGGGC
GAACACCACG GCGTACGCAT CGACGAGGCG CAACCAGACC GCCAGCCGAT CCCAGCGCCG
GATATCCGTC AGCGCCAGAT CTCCATCGGC CCGGTCCTGG TATTCGGCGC GAGCAACTTC
CCGCTGGCGT TCTCCACCGC CGGAGGTGAC ACGGCATCGG CGTTGGCCGC CGGCTGCCCC
GTGATCGTGA AGGCGCACAA CTCCCACGCG GGCACCGCCG AACTCGCCGG CCGCGCGATC
TCCGATGCGG TCGCCCAGTC GGGGTTGCCC GCCGGCGTCT TCTCGATCAT CTTCGGCGCA
GGCAGCGCCG TCGGGCAAGC CCTCGCCCAG GACCCGGCCA TCAAGGCCAT CGCGTTCACC
GGCTCACAGG CCGCCGGCAC CGCACTGATG GCCACGGCCG CGGCTCGTCC GGAGCCCATT
CCGGTGTACG CCGAGATGTC GAGCATCAAT CCCGTGATCC TTCTGCCGGG TGCGGTCGCC
GAGTGTGCCG AGGCGCTCGC CACGGGCTTC GTCGGATCGC TGACGCTGGG TGCCGGCCAG
TTCTGCACCA ATCCCGGGCT CATCTTCGTC CCCGCCGGCC AGGCAAGGTT CGTCGAGGCT
GTCGGCGAGC TCCTCCGGGA ATCCGTCGGC CAGACGATGC TCTCGGCGAA TATCGCCGCC
GCCTATACGG AGGGTCTCGA ACGACTAGCT GACGCCGGGG TCACCCAGGT TGCGACCGGT
GCTGAGGGGG CGACGCTCAA CGCACCCGCC CCGGCGCTCT TCACGACCAC TGCTGCGCAC
TTCCGCGATT CACCGGACAT GCAGGAGGAG GTCTTCGGCG CCGCAGCCCT CGTCGTCACC
TACGACGACC AGGCCGAGCT ACGCGAGACG CTGCGGGAGA TGCAGGGACA GCTGACCGCG
ACCATTCATG CGGCGATCGG TGATCAGGCC CTCGCGGCCG ACCTGCTGCC CGTGCTCGAG
ACCATGGCCG GGCGCATCCT CTTCAACGGG TGGCCGACCG GGGTCGAGGT CACCCACGCC
ATGGTGCACG GTGGCCCGTT CCCCGCTACC AGTAATGCGA TGACGACCTC GGTCGGCACG
CTCGCGATCC AACGCTTTCT CCGGCCGGTC AGCTACCAGA ACCTGCCGGC GTCGCTACTT
CCCGAGCCAC TGCGGGTGGA TAACCCCTGG CACCTGCCCC GTCGCCTGAA TGGGATGCCG
CAGACGTGA
 
Protein sequence
MTPTGASLIA GRSVVGTAGS TRAHNPATGE ALGPEFGYAG PEDLAAATRA ATEAFEPYRA 
TSPSERAAFL DLIADNLDAA RDAIVARAVL ESGLTEARLF GEHARTVNQL RLFAREVRLG
EHHGVRIDEA QPDRQPIPAP DIRQRQISIG PVLVFGASNF PLAFSTAGGD TASALAAGCP
VIVKAHNSHA GTAELAGRAI SDAVAQSGLP AGVFSIIFGA GSAVGQALAQ DPAIKAIAFT
GSQAAGTALM ATAAARPEPI PVYAEMSSIN PVILLPGAVA ECAEALATGF VGSLTLGAGQ
FCTNPGLIFV PAGQARFVEA VGELLRESVG QTMLSANIAA AYTEGLERLA DAGVTQVATG
AEGATLNAPA PALFTTTAAH FRDSPDMQEE VFGAAALVVT YDDQAELRET LREMQGQLTA
TIHAAIGDQA LAADLLPVLE TMAGRILFNG WPTGVEVTHA MVHGGPFPAT SNAMTTSVGT
LAIQRFLRPV SYQNLPASLL PEPLRVDNPW HLPRRLNGMP QT