Gene Ndas_4190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4190 
Symbol 
ID9248064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5004300 
End bp5005946 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content71% 
IMG OID 
Producthelicase domain protein 
Protein accessionYP_003682089 
Protein GI297563115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCTGCC TGATCGTCCA GTCCGACAAG ACGCTCCTGC TGGAGGTCGA CCACGAGCTG 
GCCGGCGAGT GCCGCCGGGC CATCGCCCCC TTCGCCGAAC TCGAACGCGC GCCCGAACAC
GTGCACACCT ACCGGATCAC ACCGCTGGCC CTGTGGAACT CCCGTGCCGC CGGGCACGAC
GCCGAGCAGG TCGTGGACGC GCTGATCCGG TTCTCCAAGT TCCCGGTGCC GCACTCGCTG
CTGGTGGACA TCGCCGAAAC AATGGACCGC TACGGGCGGC TGCGGCTCGT GGGCGACCCG
CGGCACGGTC TGGTCCTGGA GTCCTCCGAC CGCGCCGTGC TGGAGGAGGT CGTCCGCGCC
AAGAAGCTCA AGGGCATGCT GGGCGAGCGG CTGGGCGAGG ACTCGGTCGC CGTGCACCCG
AGCGAGCGCG GCAACCTCAA GCAGGCGCTG CTCAAGATCG GGTGGCCCGC CGAGGACCTG
GCCGGGTACG TGGACGGCGA GGCGCACCCC ATCGACCTCG ACCAGGACGG CTGGGAGCTG
CGCGGCTACC AGAGGGAGGC CGCGGAGAGC TTCCACGCGG GCGGGTCCGG CGTGGTGGTC
CTGCCGTGCG GCGCGGGAAA GACCGTGGTC GGCGCGGCGG CGATGGCCAT GACCGGGGCG
ACCACGCTCA TCCTGGTGAC GAACACGGTG TCGGTGCACC AGTGGAAGAC CGAGCTGCTG
CGGCGCACCT CGCTGACCGA GGACGAGATC GGCGAGTACT CGGGCACCCG CAAGGAGATC
CGCCCCGTCA CCATCGCGAC CTACCAGGTC ATGGCGGCCA GGCGCAAGGG CGTGTACACG
CACCTGGAGC TGTTCGACGC CCGCGACTGG GGCCTGGTCG TCTACGACGA GGTGCACCTG
CTGCCCGCGC CGATCTTCCG GATGACCGCC GACCTCCAGG CCCGGCGCCG ACTGGGCCTG
ACCGCGACCC TGGTGCGCGA GGACGGCCGC GAGGGCGACG TGTTCTCGCT GATCGGCCCC
AAGCGCTACG ACGCGCCGTG GAAGGACATG GAGAACCAGG GGTGGATCGC CCCCGCGGAC
TGCGTCGAGG TGCGGGTGGA CCTGTCGGAG GCCGAGCGGC TGGCGTACGC GACCGCCGAG
CCGGAGGACC GCTACCGGTT CTGCGCCTCC TCCGAGACCA AGACCTCGGT GGTCCGGGAG
ATCGTGGAGC GCCACCCCGA TGAGCAGGTG CTGGTCATCG GCTCCTACAT CGACCAGCTG
GACGAGCTCG GCGCGTCGCT GGGCGCGCCG GTGATCAAGG GCGAGACCCG CAACAAGGAG
CGCGAGCGCC TCTTCGACGC CTTCCGCTCC GGGGACCTGC GGACGCTGGT GGTGTCGAAG
GTCGCGAACT TCTCCATCGA CCTGCCCGAG GCCGGTGTCG CGGTGCAGGT GTCGGGGTCC
TTCGGCTCGC GGCAGGAGGA GGCGCAGCGG CTCGGCCGGG TACTGCGCCC CAAGGCCGAC
GGCCGCGCCG CCCGCTTCTA CGCGGTGGTG GCGCGCGACA CCCTCGACCA GGACTACGCC
GCGCACCGGC AGCGCTTCCT GGCCGAGCAG GGGTACGCCT ACCGGATCAC CGACGCCGGC
GACCTGCTGT CGGGCGAGGA GATCTGA
 
Protein sequence
MSCLIVQSDK TLLLEVDHEL AGECRRAIAP FAELERAPEH VHTYRITPLA LWNSRAAGHD 
AEQVVDALIR FSKFPVPHSL LVDIAETMDR YGRLRLVGDP RHGLVLESSD RAVLEEVVRA
KKLKGMLGER LGEDSVAVHP SERGNLKQAL LKIGWPAEDL AGYVDGEAHP IDLDQDGWEL
RGYQREAAES FHAGGSGVVV LPCGAGKTVV GAAAMAMTGA TTLILVTNTV SVHQWKTELL
RRTSLTEDEI GEYSGTRKEI RPVTIATYQV MAARRKGVYT HLELFDARDW GLVVYDEVHL
LPAPIFRMTA DLQARRRLGL TATLVREDGR EGDVFSLIGP KRYDAPWKDM ENQGWIAPAD
CVEVRVDLSE AERLAYATAE PEDRYRFCAS SETKTSVVRE IVERHPDEQV LVIGSYIDQL
DELGASLGAP VIKGETRNKE RERLFDAFRS GDLRTLVVSK VANFSIDLPE AGVAVQVSGS
FGSRQEEAQR LGRVLRPKAD GRAARFYAVV ARDTLDQDYA AHRQRFLAEQ GYAYRITDAG
DLLSGEEI