Gene Noca_4785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4785 
Symbol 
ID4595382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp102420 
End bp104594 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content66% 
IMG OID639772572 
Producthelicase domain-containing protein 
Protein accessionYP_919232 
Protein GI119714090 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0599344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.582969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGA TCGACCCGCG CCGATTCTCC CGTGGCGCGC TGCTGGACTC CAAGGCACTC 
AAGGACGTCA CCGTCCGACG GCTCAAGACC GACCTGTCCG GCAAGGGCTT CAAGAAGCGC
AAGGTCGACG CCCTCGACTT CACCCCGTCG GACAGTGAGC AGGAGAAGTT CTCTCTGCTG
GACGAGATCG TTACCAAGAG CGCCAAGCAG AACGGCACCA AGCCGACCGG CGACATCGTG
ACGATGTTGT TGAAGAAGAG GTTCTTGTCC AGCCCCTTCG CGTTCGGCAT GACCCTGAGC
CACTACTTGT CCTCGAGGGC CGGGCGTGGC CTGTCCGAGG ACGACTACGA CGACGTCTTC
GGTGAGGGTC AGGCCGACGA GGAAGAAGGC CTGTGGGAGC AGGACGAGGC CGAGCGACTC
CGCGAGTCCA AGGGCTCTGA CCCGTTGGTC GCCGCGGAGC CCGGCCAGCT CGAATCACTT
ATGGAGTGGG GGTTGAGCTA CGAGAGCCGC GCCGACTCCC GCCTCGACCG CCTCATCGGA
TTCCTCGACG CGGTGTGCCG CCCGGACGGC AAGAGCTGGT CCAACGAGCG CGTCGTGATC
TTCACCGAGT ACGCCCACAC CGTCGACTGG CTTACCCGCG TCCTGCGCCA GCGCGGCTAC
GTCGAGGACC GACTGGCCGT GATCCAGGGC TCGACCAAGC CTGAGGACCG CGAATACATT
CGCTCGCAGT TCACCGCCGA TCCGGCCAAG GAGCTGGTCC GCGTCCTGCT CGCCACCGAC
GCCGCCGGTG AGGGCATTGA CCTCCAGACC CACTGCCATC GGCTGGTGAA CTTCGACATC
CCGTTCAACC CGTCCCGGTT GGAGCAGCGC ATCGGCCGCA TCGACCGCTA CGGCCAGACC
GATGAACCCC AGGTCTTCCA CTTCGTCCCC GTCGCAGGCG CCTCTACCTA TGCGGCGGAC
GTGGACTTCA TGTCGCGCAT CGCCCGCAAG ATTGCCCAGG TCCAATACGA CCTCGGCTCG
GCCAACCAAG TCGTCGGCGA GGAGATCCAG ACGCACTTCG CCCGCCGCAC CCCTGCCAAG
GCGAAGGCCA AGGGAGTCGA CACCAACGAG GTCATCAACG CCGCGCTTGC CGGCGGGTTG
GAGCTGAACA CTCGGCTGAC CCAGTTGGAG GTGGGGTACG ACGCCTCCCG CACCGAGATG
CACCTCGACC CCGCCAACCT TCGTCGCGTG GTCGACACCG CCCTGCGGAT CAACCACCAG
TCCCCGCTCG TGCCCAACCA CAAGTTCGCC CAAGACACCG ACGCCGAGGT GTACGACCTC
CCGACCCTGA CCACGGGCTG GCATGACACG CTGCGAGGGC TCGACACTCG ACTCAAGCCC
GGTGTGCAAC GCCCGATCAC GTTTGACGCC AACGCAGCCG AGGGCCGCGA CGATCTTGTT
TACGTTCACC TCGGGCACCC GATCGTCCAG AAGGCGCAGC GCCTGCTGCG CCGATCCCTT
TGGAGCGTCG ATTCGCCACT AAGTCGCGTC ACTGCCGTTG TGGTCGATGA CCTGGAGGAG
TCCTTCGTCG CGGCCGTCAC CCGCATGGTC CTCGTTGGGC GAGGCGGCAT CCGCCTCCAC
GAAGAAGTGT TCCTCGCCGG CGTTCGTGTG AGGGGACGCC GTGCGATGGC CGAGGAAAAG
GCCGAGGCCG CGCTCGACGA CGCACTCGAC CGCGAGCGGC TGGCACTGGC CGACTCGCAG
GTGCGCGACC AACTCTGCGA CCTATGGAAC GTGCCCGACG CACCGCTGCG ACTTCGCTTG
GAAGAGTCGA TGCAGGCTCG CGCAGGACGT CGCCACGAGC TCGTCATGGA ACAGCTCACC
AAGCGGCAAG AGGCCGACAC CCAACGCGCC CACGAGATCT TCGCCGCCTT CCGCACCAAC
CTCCGCGAGT CGCTCGCGGC GCTCAAGGCC GCCGAAGACG AGGCCCAAGG GCAGTTGTTC
TCCGACCCCG ACCAGCAGCG CCAGTGGAGG CGTGACGTCG AGGCGATGAC CCGGCGTCTC
GAAGAGCTTG ACGACGAAGA AGCACGCGAG ATCGCCGCCA TCACCGACCG GTATGCCGGG
GTGAAGCCAC ACACCACTGC CGCAGCGGTC GTGTTTGCGC TGACCCGCTC CGACGCCGAC
GGGTGGATCG ACTGA
 
Protein sequence
MEMIDPRRFS RGALLDSKAL KDVTVRRLKT DLSGKGFKKR KVDALDFTPS DSEQEKFSLL 
DEIVTKSAKQ NGTKPTGDIV TMLLKKRFLS SPFAFGMTLS HYLSSRAGRG LSEDDYDDVF
GEGQADEEEG LWEQDEAERL RESKGSDPLV AAEPGQLESL MEWGLSYESR ADSRLDRLIG
FLDAVCRPDG KSWSNERVVI FTEYAHTVDW LTRVLRQRGY VEDRLAVIQG STKPEDREYI
RSQFTADPAK ELVRVLLATD AAGEGIDLQT HCHRLVNFDI PFNPSRLEQR IGRIDRYGQT
DEPQVFHFVP VAGASTYAAD VDFMSRIARK IAQVQYDLGS ANQVVGEEIQ THFARRTPAK
AKAKGVDTNE VINAALAGGL ELNTRLTQLE VGYDASRTEM HLDPANLRRV VDTALRINHQ
SPLVPNHKFA QDTDAEVYDL PTLTTGWHDT LRGLDTRLKP GVQRPITFDA NAAEGRDDLV
YVHLGHPIVQ KAQRLLRRSL WSVDSPLSRV TAVVVDDLEE SFVAAVTRMV LVGRGGIRLH
EEVFLAGVRV RGRRAMAEEK AEAALDDALD RERLALADSQ VRDQLCDLWN VPDAPLRLRL
EESMQARAGR RHELVMEQLT KRQEADTQRA HEIFAAFRTN LRESLAALKA AEDEAQGQLF
SDPDQQRQWR RDVEAMTRRL EELDDEEARE IAAITDRYAG VKPHTTAAAV VFALTRSDAD
GWID