Gene Noca_4493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4493 
Symbol 
ID4597012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4750871 
End bp4752544 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content74% 
IMG OID639779104 
Productphenylacetic acid degradation protein paaN 
Protein accessionYP_925677 
Protein GI119718712 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGA CGACCGCGGG AGCCCTCTTC GACCGGCACC GGGCCCGGCT GGACGCCGCC 
GTCGCCGCCT GCGCGAGCCG GGAGTACTAC TCGGCGTTCG ACGAGTCGCC GTCGCCGCGC
GTGTACGGCG AGACCGCGGC CGCCGAGGGC AAGGCCGCGT TCGAGGCCTG GCTCGGCTCG
CCGTTCCCGC TCCCGACCCC GGGTGCCCAG GGCAGCGTCG CGACCGAGCG CTCGCCGTAC
GGCGTGGACC TCGGCGTCTC CTACCCGCGA GCGGTCGACG TCGACGCGCT GCTCTCGGCG
GCGCGGGCCG GCATGAAGGG GTGGCGCGAC GCGGGCCCGG ACGGACGGAC CGGGGTGTGC
CTGGAGATCC TGGCCCGGCT GCACGCGCGC ATCTTCGAGC TCGCCAACGC GGTGCAGCAC
ACCAGCGGGC AGGCGTTCGT GATGGCGTTC CAGGCCGGCG GGGCGCACGC GCTCGACCGC
GCCCTGGAGG CGATCGCCTA TGCGTGGACC GAGATGACCC GCACCCCGGC CACCGCCATC
TGGGAGAAGC CCGGCCGCGG TGGGCCGCTG CGGATGGAGA AGACGTTCAC CGTGGTGCCC
CGCGGGGTCG CGCTGGTGAT CGGCTGCAAC ACCTTCCCCA CCTGGAACTC CTGGCCGGGG
CTGTTCGCGT CGCTGGTCAC CGGCAACCCG GTCGTGGTCA AGCCACACCC GGCGGCCGTG
CTGCCGCTCG CGATCAGCGT CCAGGTGTGC CGCGAGGTGC TGGCCGAGGC CGGCTTCGAC
CCCGACCTCG TCCTCCTCGC CGCCGAGGAG CCCGAGGACC GCCTGGCCGC GACGCTCGCG
GTCCGCCCCG AGGTGCGGCT GATCGACTTC ACCGGCGGGA ACGCCTTCGG CGACTGGCTC
GAGGCCAACG CGGGCCAGGC CGTCGTCTTC ACCGAGAAGG CCGGCGTCAA CACGGTGGTC
GTCGACTCGA CCTCCGACTT CGCGGCGATG TGCCAGAACC TTGCGTTCTC GTTCGCGCTC
TACTCCGGCC AGATGTGCAC CGCCCCGCAG AACGTCTACG TGCCGGCCGC CGGGATCGCG
ACCGAGGACG GCGTGCGCTC GCCCGCCGAG GTGGGCGCCG GGATCGGCGC GGCCCTCCAG
GGGCTGCTCG GGGACGACGC GCGCGCGGTC GAGCTGCTCG GCGGCATCGT CAACGACGGG
GTGCTGGCCC GCCTGGAGAA GGCCGGCCGC GGCCACGTCC TCGTGCCGTC CCGCGAGGTC
GCGCACCCGG CGTACGCCGA CGCGACCGTC CGCACCCCCA TGCTCGTCGG GCTCACCGCG
GAGGACTCCG ACGTGTACGA GTCCGAGTGC TTCGGGCCGG TCGCCTACCT GATCGAGACC
GAGGGCACCG ACCAGTCGAT CGACCTGTTC CGGCACACCG TGCTGGCCCA CGGCGCGATG
ACCGCCGCGG TGTACTCCAC CTCTCCCGAG GTCGTCGAGC AGGTGCGCGA GGCCGCCCTC
GAGGCGGGGG TCGCCCTGTC GGAGAACCTC CTGGGCGGGG TGTTCGTCAA CCAGTCCGCG
GCGTACTCCG ACTTCCACGG CACCGGCGCC AACCCCGCCG CGAACGCGGC GTACACCGAC
GGCGCCTACG TCGCCTCGCG GTTCCGCGTC GTGCAGTCCC GCCGCCACGT CTGA
 
Protein sequence
MTETTAGALF DRHRARLDAA VAACASREYY SAFDESPSPR VYGETAAAEG KAAFEAWLGS 
PFPLPTPGAQ GSVATERSPY GVDLGVSYPR AVDVDALLSA ARAGMKGWRD AGPDGRTGVC
LEILARLHAR IFELANAVQH TSGQAFVMAF QAGGAHALDR ALEAIAYAWT EMTRTPATAI
WEKPGRGGPL RMEKTFTVVP RGVALVIGCN TFPTWNSWPG LFASLVTGNP VVVKPHPAAV
LPLAISVQVC REVLAEAGFD PDLVLLAAEE PEDRLAATLA VRPEVRLIDF TGGNAFGDWL
EANAGQAVVF TEKAGVNTVV VDSTSDFAAM CQNLAFSFAL YSGQMCTAPQ NVYVPAAGIA
TEDGVRSPAE VGAGIGAALQ GLLGDDARAV ELLGGIVNDG VLARLEKAGR GHVLVPSREV
AHPAYADATV RTPMLVGLTA EDSDVYESEC FGPVAYLIET EGTDQSIDLF RHTVLAHGAM
TAAVYSTSPE VVEQVREAAL EAGVALSENL LGGVFVNQSA AYSDFHGTGA NPAANAAYTD
GAYVASRFRV VQSRRHV