Gene Ndas_3707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3707 
Symbol 
ID9247576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4450272 
End bp4451678 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003681611 
Protein GI297562637 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.271047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCTTCC AGAAGGTCCA CCGGATCCGC CAGGAGGCGG AACCGGGCCT GCCCGCGGTC 
ACCGAGGCCC CGGCCGCCGT CATCCCCGTC CAGGTCCCGA CCGGCACGGG CGCACCCGCC
GCCCCGGACG TCCCCGCCGA CCTCGGGCCG CGCCTGGACG GGGTCCGCCT GGTCATGGTC
AACTGGCGCG ACCCCTGGCA GTCCACCGCG GGGGGAGCCG AGGAGTACTC CTGGCGGATC
AGCCGCCACC TGGCCGAACG CGGTGCCATC GTCACCTTCC TCACCAGCCG CGAGCCCCAC
CAGGCGCGCG TGGAGACCAG GGACGGGATC GTCATCCGCC GCATGGGCGG CAGGTTCACC
GTCTACCCGC GCGTCATGGC GTGGCTGGCC CTGTGGCGCC GCGAGTACCA GCTCGCCTTC
GACTGCATGA ACGGCGTCCC CTTCCTGTGC CGCCTCGTCC TGCGCCGCAG CACGCGGGTG
GTCAGCGTCG TCCACCACGT CCACGACCTC CAGTTCAACG CCTACTTCCC GGCGCCCGTC
GCCTGGCTGG GCCGCACGCT GGAGTCGGTC GTGGCCTCGC GCGTCTACCG CCGCTGCACC
ACCGTCACCG TCTCGGAGTC CTCCCGCCGG GCCATGCGCG AGAAGCTGGG CTGGCGCGCG
CCGATCGAGA TCATCCACAA CGGCGGACTC CCCGGGCCCC AGAAGCCCCT CGACGACGCG
CCCGCCCCCG CCGACATGGG CCACCCGGCC GTGGTCAGCC TGGGCCGCCT GGTCGTCCAG
AAGCGGGTCT CGCGGGTGGT CGACCTCGCC CGCGCGCTGC GCGAGGAGCA CCCCGACCTG
AAGGTGCACA TCATCGGCCG CGGCCCCGAG GGCGAACCCC TGGCGGAGCA GGTCGCCCGT
GACGGCACGG GCGACCGCGT GCGCCTGCAC GGCTTCCTGC CCGAGGAGGA CAAGAACAGC
GTCCTGGCCT CGTGCCACCT CCATGTCACC GCCTCCGAGT TCGAGGGCTG GGGCCTGACC
GTCATCGAGG CGGCCCGCCT CGGCGTGCCC ACCGTGGCCT ACGATGTGGA CGGACTCCGC
GACTCGGTCC GCGACGGCGA GACCGGGTGG CTCGTGCGCG AGGGCGAGGA ACTCGCCGAC
GTGGTCGCGC GCGCCCTGGA GGAGCTGTCC GACCCCCGCC GGGCCGAAGC CGTCCGCCGC
GCCTGCCGCG CGTGGGCGTC CCGGTTCACG TGGGAGGCCA GCGGCGCGCG GATGACCCGG
CTCGTCGCGA GGGAGCTGGG CCTGCCCGGC GCCGCGGAAC CGGACGCCCC CGCCACCGAC
GCCCCCGTCC CCGACCCCCT CACCTCCGAC GACACGGGCG CCCCCGCCGC CCGGAACACG
GCAGCAGGCA GGAAAGCGAA GACGTGA
 
Protein sequence
MVFQKVHRIR QEAEPGLPAV TEAPAAVIPV QVPTGTGAPA APDVPADLGP RLDGVRLVMV 
NWRDPWQSTA GGAEEYSWRI SRHLAERGAI VTFLTSREPH QARVETRDGI VIRRMGGRFT
VYPRVMAWLA LWRREYQLAF DCMNGVPFLC RLVLRRSTRV VSVVHHVHDL QFNAYFPAPV
AWLGRTLESV VASRVYRRCT TVTVSESSRR AMREKLGWRA PIEIIHNGGL PGPQKPLDDA
PAPADMGHPA VVSLGRLVVQ KRVSRVVDLA RALREEHPDL KVHIIGRGPE GEPLAEQVAR
DGTGDRVRLH GFLPEEDKNS VLASCHLHVT ASEFEGWGLT VIEAARLGVP TVAYDVDGLR
DSVRDGETGW LVREGEELAD VVARALEELS DPRRAEAVRR ACRAWASRFT WEASGARMTR
LVARELGLPG AAEPDAPATD APVPDPLTSD DTGAPAARNT AAGRKAKT