Gene Ndas_3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3370 
Symbol 
ID9247235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4026201 
End bp4027304 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content72% 
IMG OID 
ProductA-factor biosynthesis repeat-containing protein 
Protein accessionYP_003681281 
Protein GI297562307 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATGC CCGTGCACCA CCTCCTCAAC GGCGCCACCG ACACCGTGAC CGCAGGCACG 
GAGGGTCCGG CCCTGGACTA CGAGCGGACC GTTGACCGCA CCGTCGTCCA CCGGGAGTCG
TTGGCGGAGG TCTTCGTCAC CGACACCCAG CCCCTCGGAG GGGACGCCCA CGCGGCCGCC
GCCCAGCTCC CCCGTTCACA CGCCTACTAC GGCGACCACC TGCTCCGCCC CCGCCGCCAC
GACCCCGTGC TGCTGCTCGA AGCATGCCGA CAGGTGGGGC TGGCCATCGC GCACACCCAC
TACGGCGTCC CCTTCGACCA CAAGTTCGTG CTCACCACCC TGGGCATCAC CATCACGCGC
CCCGAGCTGA TGACGGTGGG GACGGCTCCG TGCGCCCTCC GCATGCTCTG CTCCGTCGGG
GACAAGAGGG TCAAGGAAGG ACGCGTCGTC GGCTACGACG CCAGGTTCCG GCTCTTCGTC
GACGGCACGG AGGTCGGCAA CGCCGTCGTC GGCCTGCGGT TCAAGTCCCC GGCGAGCTAC
GAGGCGCTGC GCCTGCGCAA CCGCTCCGGC GAGCCGGTCC CCTCCACGGA GACCTTCGAC
TTCACCGTCG GCGGGGAGCT CCCCGCCCCC TACCTCGTCG GCCGGTCGAA CGGCGACAAC
GTGGTCCTGA CCGGGCTCAC GGGGGCCGGG GACACCGTGT CGGCCTCCCT GCGCGTGCTG
CCCCAGCACC CGAGCCTGTT CGACCACGCC CAGGACCACC TGCCGGGCAT GGTCCTGATC
GAGGCCGGGC GCCAACTGGC CCTGAACACG CTCCTGGAGG TCCGGGGCAC CTCGCCGGCC
AAGGCCTACC CCACCGAGAT CACCGCCACC TTCACCAGCT TCGGAGAACT GGAGCCCCGG
ACCGAGTTGC GGGCCGTCAC CGCTCCGGCG GGGGCGGAGG GGCCCGAGGA GGAGGGCGTC
TACTACACGC AGGGCGGAAT CGTGGAGTTC CTCGCGCCCA CCGGCTGCCC CGAACCCGCC
CCGACCTCCG TCGAGGTGGA CGTGCTCCAG AGGGGCGCGT CGATCTGCCG GATCGAGGTC
GGCCTGGTCC GCCTCCCCGC GTGA
 
Protein sequence
MQMPVHHLLN GATDTVTAGT EGPALDYERT VDRTVVHRES LAEVFVTDTQ PLGGDAHAAA 
AQLPRSHAYY GDHLLRPRRH DPVLLLEACR QVGLAIAHTH YGVPFDHKFV LTTLGITITR
PELMTVGTAP CALRMLCSVG DKRVKEGRVV GYDARFRLFV DGTEVGNAVV GLRFKSPASY
EALRLRNRSG EPVPSTETFD FTVGGELPAP YLVGRSNGDN VVLTGLTGAG DTVSASLRVL
PQHPSLFDHA QDHLPGMVLI EAGRQLALNT LLEVRGTSPA KAYPTEITAT FTSFGELEPR
TELRAVTAPA GAEGPEEEGV YYTQGGIVEF LAPTGCPEPA PTSVEVDVLQ RGASICRIEV
GLVRLPA