Gene Noca_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3947 
Symbol 
ID4598082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4157563 
End bp4159203 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content71% 
IMG OID639778552 
Producthelicase domain-containing protein 
Protein accessionYP_925131 
Protein GI119718166 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGACG GCCCCCTGAT CGTCCAGTCG GACAAGACGC TGCTGCTCGA GGTCGACCAC 
GAGCGCGCCG CCGACTGCCG CAAGGCGATC GCGCCGTTCG CGGAGCTCGA GCGCTCCCCC
GAGCACGTCC ACACCTACCG GCTGACGCCG CTGGGCCTCT GGAACGCCCG CGCCGCCGGG
CACGACGCCG AGCAGGTCGT CGACACGCTG CTGGAGTACA GCCGATACGC CGTACCGCAC
GCGTTGCTGG TCGACGTCGC CGAGACGATG GCGCGCTACG GCCGGCTGAG GCTGGAGAAG
CACCCCGTCC ACGGGCTGGT GCTGGTCAGC ACCGACCGGC CGGTGCTCGA GGAGGTGCTG
CGCGCGAAGA AGGTCGCGGG GATGCTCGGC GCGCGCATCG ACGAGGACAC GGTCGTCGTG
CACGCCTCCG AGCGGGGCAA CCTCAAGCAG GCGCTGCTCA AGCTCGGCTG GCCGGCGGAG
GACTACGCGG GGTACGTCGA CGGCGAGGCC CACCCGATCG CGCTCGACGA GACCGCGTGG
CACCTGCGCG CCTACCAGCG CGAGGCGGCC GAGTCGTTCT GGCACGGCGG CTCCGGGGTC
GTCGTCCTCC CCTGCGGGGC CGGCAAGACG CTGGTCGGTG CGGCCGCCAT GGCCGAGGCG
CAGGCGACCA CGCTGATCCT GGTCACCAAC ACCGTGTCGG CGCGGCAGTG GAAGGACGAG
CTGGTCCGGC GTACCTCCCT GACGCCAGCC GAGATCGGCG AGTACTCCGG GGCGGTCAAG
GAGATCCGGC CGGTCACCAT CGCGACGTAC CAGGTGATGA CGACCAAGCG GAAGGGCGTC
TACCCCCATC TCGAGCTGCT CGACGCCCGG GACTGGGGGC TGATCGTCTA CGACGAGGTG
CACCTGCTGC CGGCGCCGAT CTTCCGGATG ACCGCGAACC TGCAGGCCCG GCGCCGGCTC
GGCCTGACCG CGACCCTGGT GCGCGAGGAC GGCCGCGAGG GCGACGTGTT CTCGCTGATC
GGGCCGAAGC GGTACGACGC GCCCTGGAAG GACATCGAGG CGCAGGGCTG GATCGCCCCT
GCCGACTGCG TCGAGGTGCG GGTCACGCTG CCCTCGGGCG AGCGGCTGGC GTACGCGACC
GCGGAGCCCG AGGAGCGCTA TCGGCTCGCG TCCTGCACCC ACCACAAGAT CGACGTCGTC
GAGTCACTGG TCGCAGCCCA CCCGGGCCAG CCGACGCTGG TGATCGGTCA GTACATCGAG
CAGCTCGACG AGCTGGCTCT GGCCCTCGAC GCGCCGGTGA TCAAGGGCGA GACGAAGGTC
GCCGAGCGGC AGCGGCTGTT CGACGCCTTC CGGCACGGCG AGATCGGGCT GCTGGTCGTC
TCCAAGGTCG CGAACTTCTC CATCGACCTG CCCTCCGCCG AGGTCGCGAT CCAGGTCTCC
GGATCCTTCG GCTCCCGCCA GGAGGAGGCC CAGCGCCTGG GCCGGCTGCT GCGGCCCAAG
ACCGAGGGTC GCACCGCGCA CTTCTACACG ATCGTGTCCC GCGACACCGT CGACGCCGAG
TTCGCCCAGA ACCGGCAGCG CTTCCTCGCC GAGCAGGGCT ACGCCTACCG GATCGTGGAT
GCGGAGGACC TGCCGGCGTA G
 
Protein sequence
MNDGPLIVQS DKTLLLEVDH ERAADCRKAI APFAELERSP EHVHTYRLTP LGLWNARAAG 
HDAEQVVDTL LEYSRYAVPH ALLVDVAETM ARYGRLRLEK HPVHGLVLVS TDRPVLEEVL
RAKKVAGMLG ARIDEDTVVV HASERGNLKQ ALLKLGWPAE DYAGYVDGEA HPIALDETAW
HLRAYQREAA ESFWHGGSGV VVLPCGAGKT LVGAAAMAEA QATTLILVTN TVSARQWKDE
LVRRTSLTPA EIGEYSGAVK EIRPVTIATY QVMTTKRKGV YPHLELLDAR DWGLIVYDEV
HLLPAPIFRM TANLQARRRL GLTATLVRED GREGDVFSLI GPKRYDAPWK DIEAQGWIAP
ADCVEVRVTL PSGERLAYAT AEPEERYRLA SCTHHKIDVV ESLVAAHPGQ PTLVIGQYIE
QLDELALALD APVIKGETKV AERQRLFDAF RHGEIGLLVV SKVANFSIDL PSAEVAIQVS
GSFGSRQEEA QRLGRLLRPK TEGRTAHFYT IVSRDTVDAE FAQNRQRFLA EQGYAYRIVD
AEDLPA