Gene Noca_4473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4473 
Symbol 
ID4596992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4728175 
End bp4729635 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content74% 
IMG OID639779084 
Productsuccinate semialdehyde dehydrogenase 
Protein accessionYP_925657 
Protein GI119718692 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.623002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACT TCACCCTCGC GCAGATCCCC GACCTGCCGC TCGACCTCTG CGTCGGCGGG 
AAGGAGGTCC CGGCGTCCGA CGGGGGCCGC TTCGACGTGC TCGACCCGGC CACCGGTGCC
GTCCTCACCT CGGTCGCCAA CGGCACGGTC GAGGACGCCC TCGCCTGCGT CGACGCGGCC
GACGCCGCCG CGGCCGCCTG GGCCGCGACC GCGCCGCGGG AGCGCTCGGA GATCCTGCGC
AAGGCCTTCG AGCTGATGCG CGAGCGCGCC GACGAGCTCG CGCACCTGAT CTCCCTGGAG
AACGGCAAGG CGCTGGCCGA CGCCCGCGGC GAGGTGGCCT ACGCCGCCGA GTTCTTCCGC
TGGTACGCCG AGGAGGCGGT CCGCGCGGCC GGCTCCGTGA TGACCGCGCC GTCCGGGGCC
AACCGGATCG TCGTGCTCCA GCAGCCGGTC GGCATCTGCG TGCTGGTCAC GCCGTGGAAC
TTCCCCGCCG CGATGGCCAC CCGCAAGATC GGCCCGGCGC TGGCGGCCGG CTGCACCGTC
GTGCTCAAGC CGGCCAGCGA CACCCCGCTC ACCGCGCTGC TGATGGCCAA GATCCTCGCC
GACGCCGGCG TCCCCGCGGG CGTGGTCAAC GTGCTGCCCG CGCGCCGCTC GGGCGCCGTG
GTGTCCGCGA TGCTGCACGA CCCGCGGGTC CGCAAGCTCT CCTTCACCGG CTCGACCGAG
GTCGGCCGGG TGCTGCTGCG CGAGGCCGCC GACCAGGTCG TCAACTGCTC GATGGAGCTC
GGCGGCAACG CGCCGTTCAT CGTCCTCGAC GACGCCGACC TGGATGCCGC CGTCGACGGC
GCGATGATCG CGAAGATGCG CAACGCCGGC GAGGCCTGCA CCGCCGCGAA CCGCTTCTAT
GTCCACGCCG ACGTGGCCGA CGAGTTCAGC CGCCGGCTCG CCGAGCGGAT GGCCGCGCTG
CGGGTCGGCC CCGGCACGGC CGACGACACC GAGGTCGGCC CGCTGGTCAA CGACGAGTCG
GCCGCCAAGG TCGACGAGCT GGTCCGGGGC GCGGTCTCGG CCGGCGCGCG GGTCGTGGTC
GGCGGTCGCC GGCCGGAGCG CGAGGGCTAC TACTACGAGC CGACCGTGCT GCTCGACGTG
CCCGTCGACG CGGAGATCCT GGGCGAGGAG ATCTTCGGAC CGGTCGCCCC GGTGGTGACG
TTCACCGACG AGGACGACGC GATCCGGATG GCGAACGAGA CCGAGTACGG CCTGGTGTCC
TACGTCTACA CGCGCGACCT GGCGCGGGGG ATGCGGGTCA GCGAGCGGCT CGACTCCGGC
ATGGTCGGCC TCAACCGCGG GCTGGTCTCC GACCCGGCCG CGCCGTTCGG CGGCACCAAG
CAGTCCGGCG TCGGCCGCGA GGGCGGCCAC GAAGGCATGC TCGACTACCT GGAGTCGAAG
TACGTCGCGG TGTCCTGGTG A
 
Protein sequence
MADFTLAQIP DLPLDLCVGG KEVPASDGGR FDVLDPATGA VLTSVANGTV EDALACVDAA 
DAAAAAWAAT APRERSEILR KAFELMRERA DELAHLISLE NGKALADARG EVAYAAEFFR
WYAEEAVRAA GSVMTAPSGA NRIVVLQQPV GICVLVTPWN FPAAMATRKI GPALAAGCTV
VLKPASDTPL TALLMAKILA DAGVPAGVVN VLPARRSGAV VSAMLHDPRV RKLSFTGSTE
VGRVLLREAA DQVVNCSMEL GGNAPFIVLD DADLDAAVDG AMIAKMRNAG EACTAANRFY
VHADVADEFS RRLAERMAAL RVGPGTADDT EVGPLVNDES AAKVDELVRG AVSAGARVVV
GGRRPEREGY YYEPTVLLDV PVDAEILGEE IFGPVAPVVT FTDEDDAIRM ANETEYGLVS
YVYTRDLARG MRVSERLDSG MVGLNRGLVS DPAAPFGGTK QSGVGREGGH EGMLDYLESK
YVAVSW