Gene Noca_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1220 
Symbol 
ID4599269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1292731 
End bp1294377 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content69% 
IMG OID639775814 
Productdelta-1-pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_922421 
Protein GI119715456 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01236] delta-1-pyrroline-5-carboxylate dehydrogenase, group 1 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATACGC TCGCAGGCAT GGACGCTGTG ACCCACCCCC CGGCTCCGGT CAACGAGCCG 
AACCTGACCT ACGCACCCGG CACTCCCGAG CGGGAGTCGC TGCTGGTCGA GATCGAGCGG
CTCGAGAAGA GACAGAAGAG CCTGCGGGCC TACATCGGCG GGCGGTGGAA GGCCGGGGGC
GGCGCGGAGG TCGCGGTGGT GCAGCCGCAC GACCACCAGC ACGTGCTCGG GGTGCTGAAG
AACGCCACCC AGGCCGACGC GCGGGCGGCG GTGAAGGCTG CGGGCGAGGC GGCGCCACAG
TGGCGGGCGA TGGACTTCGA CGACCGGGCA GCCATCCTGC TCAAGGCCGC GGAGCTCCTG
GCCGGGCCGT GGCGCCAGCG GCTGAACGCC GCGACGGTGC TGGGGCAGTC CAAGACGCCG
TTCCAGGCGG AGATCGACGC GGCGTGCGAG CTGATCGACT TCTGGCGCTA CAACGTGCAC
TACGCGCGGG AGATCCTGGC CGACCAGCCG ATCGCGAACA GCCGGGGGAT CTGGAACCGG
ACCGACCACC GGCCCCTGGA GGGGTTCGTC TACGCGATCA CGCCGTTCAA CTTCACCGCG
ATCGCGGGCA ACCTGCCCAC GGCGCCGGCG TTGATGGGCA ACACGGTGAT CTGGAAGCCC
TCGCCGACCC AGCAGCTGGC CGCGTCGTTG ACCATGGAGC TGCTCGAGGA GGCCGGCCTG
CCGCCCGGCG TGATCAACAT GCTTCCCGGC GACGGCATCG ACGTCTCCGC GGTGGCGCTG
GCCCACCCGG ACCTGGCCGG GATCCACTTC ACCGGCTCCA CCCCGACCTT CCAGCACCTG
TGGCGCGAGG TCGGCAACAA CATCGAGAAC TACCGGTCCT ACCCGCGGAT CGTGGGGGAG
ACCGGCGGCA AGGACTTCAT CGTCGCCCAT GCCTCGGCCG ACCCCGACGT GCTGCGCACC
GCGATGATCC GCGGCGCCTT CGAGTTCCAG GGTCAGAAGT GCTCGGCCGC CTCTCGCGCG
TACGTCGCCC GGTCGGTGTG GACCAGGATG AAGGACTCCT TCCTCGCCGA GATCGAGTCG
ATCACCGTCG GGGACCCCAC CGACTTCTCC AACTTCATGG GCGCGGTCAT CGACGAGCGA
GCCTTCGCCA AGCACCGGAA GGCGATCGAG CGGGCCAAGC GGTCCCGCAA GCTCGACATC
GTGGTGGGTG GCACCCTCGA CGACTCGGCC GGCTGGTTCG TGGACCCGAC CGTGGTCGAG
GGCGGGGACC CGACCGACGA GATGTTCACG ACCGAGTACT TCGGCCCGAT CCTCGCGGTG
CATGTCTTCG AGGACGGCGA CTTCGAGAAG GTCGTTCGGG GCATGGAGTC GATCGCCCCT
TACGCGCTGA CCGGGTCCGT CATCGCCCAG GACCGCCGCG CGATCGCCTG GGCCCAGGAC
GAGCTCCGCT TCGCCGCCGG CAACTTCTAC ATCAACGACA AGCCCACTGG CGCCGTCGTG
GGCCAACAGC CCTTCGGGGG TGGCCGCGCG TCCGGCACCA ACGACAAGGC CGGCGCGGCG
GTCAACCTCC TGCGCTGGAC GTCGCCGCGC TCGATCAAGG AGACTTTCGT TCCGCCGACC
GACTACCGCT ACCCCTACTT GGGCTGA
 
Protein sequence
MDTLAGMDAV THPPAPVNEP NLTYAPGTPE RESLLVEIER LEKRQKSLRA YIGGRWKAGG 
GAEVAVVQPH DHQHVLGVLK NATQADARAA VKAAGEAAPQ WRAMDFDDRA AILLKAAELL
AGPWRQRLNA ATVLGQSKTP FQAEIDAACE LIDFWRYNVH YAREILADQP IANSRGIWNR
TDHRPLEGFV YAITPFNFTA IAGNLPTAPA LMGNTVIWKP SPTQQLAASL TMELLEEAGL
PPGVINMLPG DGIDVSAVAL AHPDLAGIHF TGSTPTFQHL WREVGNNIEN YRSYPRIVGE
TGGKDFIVAH ASADPDVLRT AMIRGAFEFQ GQKCSAASRA YVARSVWTRM KDSFLAEIES
ITVGDPTDFS NFMGAVIDER AFAKHRKAIE RAKRSRKLDI VVGGTLDDSA GWFVDPTVVE
GGDPTDEMFT TEYFGPILAV HVFEDGDFEK VVRGMESIAP YALTGSVIAQ DRRAIAWAQD
ELRFAAGNFY INDKPTGAVV GQQPFGGGRA SGTNDKAGAA VNLLRWTSPR SIKETFVPPT
DYRYPYLG