Gene Noca_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4844 
Symbol 
ID4595440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp173952 
End bp176033 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content68% 
IMG OID639772631 
ProductCoA-binding domain-containing protein 
Protein accessionYP_919291 
Protein GI119714149 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0385105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC TGAACCACCC CCTCAGCGCG GTGTTCCGAC CGCGCCGGGT CGCCCTCGTC 
GGCGCCTCCG ACCGGCCGGG AAGCGCGGGT GCGCTGTTCT GGGAGAACCT CTCCGGATTC
ACCGGAGAGG TTCTCCCAGT CAACGGATCT GGACGCGCAG TGGCAGGCGT CCGGGCCTAT
CGAAGCCTCA CCGAGATCCC GGGGTCGATC GACCTCGCCG TGCTGGTCGT GCCCGCGTCT
GCCGTCGCCG CGGTCGTGCG AGATGCCGGT GCAAAGGGCA TCCCGGCTTG TGTAGTCATC
ACCTCGGGTT TCGCCGAGGC CGGGCCGGAG GGTGAGCGGC TGCAGGCCGA GGTCATGGCC
ATCGCCCGTG CCGGCGGTGT CCGCATCGTC GGACCCAACT GCTTCGGCGT CCAGAACGCT
GAGGTCGGCT TCAATGCCTC CCTGTCACCG CGTCTGACGG CGAGCGCCGG CTCGCTGAGC
CTGGTCACCC AATCGGGCGC CTACGGTATG GCGCTGCATT CCTTGGCCCA GGACGAGAAC
CTGCGCTTCA ACAAGGTGTA CGCCACCGGG AACAAGGCGG ACCTCACCGA CACAGAGCTC
CTTGACTACC TGGACTCCGA GGGCGGCTCC GGGCCGATCT GCTTCTTCCT GGAGTCCCTG
CCGGACGGTC GGGCCTTCTT CGAAGCGGCA CGTCGGGTCA CGCGCAACAG GCCCGTGATC
GTCTGCCGGA CTGGGAGGTC CACTGCCGGT AGCCGCGCCG CCTGCTCACA CACAGCGAGC
CTCGCAGGCC AGCAACGCGT CTGGTCCGCG GCGTTCGCGC AGGCCGGCGT GATCGTGACC
AAGTCCGGTC TGGAGATGCT CGACGCGGCC CGTGCACTCG ACGGTGGCCA TCATCCGGCG
GGGCTCCGGG TCGGGATCGT GACGAACTCC GGCGGCACAG GAGTCGAACT GACCGACCTC
CTCGCGGACG AAGGACTGGA CGTTCCGCTG CTCAGCGCAC CGTTGCAAAC CAAGCTCGCA
GAGACGCTTC CCGACCTCGC CAGCCCCAGC AACCCCGTCG ACCTGACCAC AGCCTGGCAA
CGCTTCACCG AGCTCTACCC ATTCGCCATC GAACAACTCG TCCGAAGCGG CGAGATCGAC
GCGGTGATCG CCGTACTGCT GCAGCGCTCG GCCGATGAAG GCGTCGCAAC CGCCGTAGCT
GAGAAGGTGC AGCAACTGCG CGCCGAGGGC GTCACCGTGC CCGTCTACGT CTGCTGGGTC
GCCGCCCACG ACGACCGCGC GAACGCGGCG CCACTGCGCT GCGCGGGCGT TCCCGTCTTC
GAGTGGCCCG AACGAACCGC GCGTGCGGTC GGGCACGCCG CGCGCTATGG AGCCTGGCTG
CGGCCGGAAG GCTCGTGGCT GACGCTCGAC CGCAGGGTAA GTGTCGAGCC AGGGCTCGGG
GAACAAGCCA CAGGGTGGCT CCCGGTCGGC GAGTCCGCGT TCTTGTTGGC CGAGCACGGA
CTTCCGCTGG TCCCGTGGTC CCTGTGCGGG AGCGTCGCCG ATGCGGTGCA GGTAGCGGAG
TCGTTCGGTT ACCCGGTGGT CCTCAAGGCC ATCCATGCAG ACTTGCTGCA CAAGTCTGAT
GCCGACGGAG TTCGCCTTGG ACTCACTGGA CCCGACGAAG TCGCTGCTGC AGGCGAATTC
CTCCTGAAGT TCCGCGAGGG CACAGAACTG CTCGTGCAGC AGCAACACAG CGGCGTCGAG
GTCATCGTGG GCGGAGTGCG CGACGCTGAG TTCGGCCCGG TCGTGCTCGT CGGCATGGGG
GGTGTCGACG TGGAAGTCCA GGACGATGTT GTCCTCGCCC TTGCCCCTCT TCTGCTCCCC
GAAGCCGAGG CGATGGTGCG GCGACTCAGG GGTGCGGCGG CACTGCTCGG GGCGCGAAGG
CCGGCAATCG ACCTGACCTC CCTGGCTGAT GTCGTGTGCC GTCTAGGCCG GCTCATGGTC
GAGCATCCGG ACATCGAGGA GGTTGATCTC AACCCCGTTC TCGCGCGCCC CGACGGGTGC
ACGGTCGTCG ACTGGCGAGT TCGGGTGCGT TCTTCGTCCT GA
 
Protein sequence
MRLLNHPLSA VFRPRRVALV GASDRPGSAG ALFWENLSGF TGEVLPVNGS GRAVAGVRAY 
RSLTEIPGSI DLAVLVVPAS AVAAVVRDAG AKGIPACVVI TSGFAEAGPE GERLQAEVMA
IARAGGVRIV GPNCFGVQNA EVGFNASLSP RLTASAGSLS LVTQSGAYGM ALHSLAQDEN
LRFNKVYATG NKADLTDTEL LDYLDSEGGS GPICFFLESL PDGRAFFEAA RRVTRNRPVI
VCRTGRSTAG SRAACSHTAS LAGQQRVWSA AFAQAGVIVT KSGLEMLDAA RALDGGHHPA
GLRVGIVTNS GGTGVELTDL LADEGLDVPL LSAPLQTKLA ETLPDLASPS NPVDLTTAWQ
RFTELYPFAI EQLVRSGEID AVIAVLLQRS ADEGVATAVA EKVQQLRAEG VTVPVYVCWV
AAHDDRANAA PLRCAGVPVF EWPERTARAV GHAARYGAWL RPEGSWLTLD RRVSVEPGLG
EQATGWLPVG ESAFLLAEHG LPLVPWSLCG SVADAVQVAE SFGYPVVLKA IHADLLHKSD
ADGVRLGLTG PDEVAAAGEF LLKFREGTEL LVQQQHSGVE VIVGGVRDAE FGPVVLVGMG
GVDVEVQDDV VLALAPLLLP EAEAMVRRLR GAAALLGARR PAIDLTSLAD VVCRLGRLMV
EHPDIEEVDL NPVLARPDGC TVVDWRVRVR SSS