Gene Noca_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1116 
Symbol 
ID4599369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1180451 
End bp1181899 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content69% 
IMG OID639775712 
Productsuccinate semialdehyde dehydrogenase 
Protein accessionYP_922319 
Protein GI119715354 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.103015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTGGG CTGCTGGGCG GGAGGTCAAC GCGCGTGACC GGACCGGTAC CAACCTCGTG 
AACGGCAATC GCACGTTCGC AGTGGAGGAC CCCGCCACTC TCGAGCTCAT CGCAGAGGTG
GCCGACCACG GCGAGACCGA GGCGCGGGCC GCGGTCGATC GTGCGCACGA GGCCTTCGCG
GGGTGGTCAG GCACCTCTCC GCGGCAGCGT TCCGATGTGC TCCGACGCGC CTATGAGCTG
ATGCTGCGCG ACGAGGGACG GCTGAGCGCT CTGATAGCTC GCGAGAACGG CAAGTCGCTG
GCCGACGCAG CGTCCGAGGT TGTCTATGCC GCCGAGTTCT TCCGGTGGTA CGCCGAGGAG
GTGGTGCGCA CTGAGGGCTC GTACGGCGAG GCGCCGGCCG GCGGTGTACG GACCATCGTG
CACCACAAGC CCGTCGGGGT GGCGGCCCTG GTGATTCCGT GGAACTTCCC CGCCGCGATG
GCCACCCGCA AGATCGCCCC AGCCCTTGCC GCGGGATGCA CCGTTGTTCT CAAGCCGGCC
GCCGAGACCC CGCTGACCGC GATCGCGATC GCCGACTTGC TCGCCGAGGC CGGCCTACCG
GCCGGCGTCG TCGAGTTGGT CACCACCACC CACGCCGGCG CCGTCGTGAC CGCGTGGCTC
GAGGACGACC GGGTGCGCAA GGTGTCCTTC ACCGGCTCCA CCGGCATCGG CCGGTTGCTG
CTGCATCAAG CCGCCGACCG TGTCGTCAAC ACCTCCATGG AGCTCGGCGG CAACGCCGCT
TTCATCGTCA CCGAGGATGC CGACCTCGTC GACGCCGTCG CCGGGGCGAT GATCGCCAAG
TTCCGCAACG GCGGGCAGGC GTGCACCGCT GCGAACCGGA TCTACGTGCA CCGTGACGTC
GCAGACGCCT TCGTCGCGCT GGTCGGTGCC CAGGTCGAGA AGCTGAGCGT CGGTGCGGCG
AATGACGGCA ACATGATCGG CCCACTCATC AGCGCCGCGG CGGTGAGCCG CGTCGGTGCA
GCCGTGGACC GGGCGATCGA GGAGGGCGCA CGGGTCGCTG CGCGAGCACA ACTGCCCGAC
GCACCCGGGT ACTTCTACCC GCCGACCGTC CTCACCGACG TGCCCTCTAC CTCCTCGATC
CTGGCTGAGG AGATCTTCGG TCCAGTGGCT CCCATCGCGA CCTGGGACGA CGAACAGGAG
CTGCTTCGCC AAGTGAACGG TACTGAGTAC GGCCTCGCCG CGTACGTCTA CACCGGTCAC
CTCGAACGAG GGCTCCGCCT CGGCGAGCGT ATCGAGGCTG GCATGGTCGG CATCAACCGC
GGCATCGTCT CCGATCCCTC TGCGCCGTTC GGTGGCGTCA AGCAGAGCGG CCTGGGCCGT
GAAGGCGCTC GCGAAGGGCT CCGTGAGTTC CAGGAGACCC AGTACCTGAG TATCAGCTGG
GACGACTGA
 
Protein sequence
MPWAAGREVN ARDRTGTNLV NGNRTFAVED PATLELIAEV ADHGETEARA AVDRAHEAFA 
GWSGTSPRQR SDVLRRAYEL MLRDEGRLSA LIARENGKSL ADAASEVVYA AEFFRWYAEE
VVRTEGSYGE APAGGVRTIV HHKPVGVAAL VIPWNFPAAM ATRKIAPALA AGCTVVLKPA
AETPLTAIAI ADLLAEAGLP AGVVELVTTT HAGAVVTAWL EDDRVRKVSF TGSTGIGRLL
LHQAADRVVN TSMELGGNAA FIVTEDADLV DAVAGAMIAK FRNGGQACTA ANRIYVHRDV
ADAFVALVGA QVEKLSVGAA NDGNMIGPLI SAAAVSRVGA AVDRAIEEGA RVAARAQLPD
APGYFYPPTV LTDVPSTSSI LAEEIFGPVA PIATWDDEQE LLRQVNGTEY GLAAYVYTGH
LERGLRLGER IEAGMVGINR GIVSDPSAPF GGVKQSGLGR EGAREGLREF QETQYLSISW
DD