Gene Ndas_5307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5307 
Symbol 
ID9249207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp472775 
End bp475144 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycosyl transferase family 51 
Protein accessionYP_003683193 
Protein GI297564220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.93625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTCAAA GAATCAGCCA ACTGGTCGGC GTCGGCGTCG TCGCTGGACT CCTCGTCGCC 
GCTCTCGCGC TCCCCGCCGT CGGGGGACTG GGCATCACGG CGCGCAACGT GGCCAGCGGC
TTCCTGGAGA TGCCCAGCAA TCTGGAGACT CCGCCGCCTC CTCAGAGATC GACGATCTAT
GACAGCGAGG GCGGCGTCAT CGCGGAGATC TTCGACCAGA ACCGCGAGCT CGTCGAGATC
GAGGACGTCT CGCCGGTCAT GATCGACGCC ATCCTCGCCA TCGAGGACGC CCACTTCTAC
GAGCACGGCG GCATGGACGT GTCGGGCACG CTGCGCGCCG CGCTCCGCAC GCTCAGCGGC
GACACCCAGG GCGCGTCCTC CATCACCCAG CAGTACGTCA AGAACGTCCA GATCGAGGCG
GCCACCACCC AGGAGGAGCT GGAGGAGGCC TCCGAGACGA CGATCGCCCG CAAGATCCGG
GAGCTGCGCT ACGCCGTCGC CCTGGAACAG CGGATGACCA AGGACGAGAT CCTCGAGGGC
TACCTCAACA TCGCCTACTT CAGCGACGGC GCCTACGGGG TCGAGTCCGC CGCGCAGCAC
TTCTTCCAGG TGCCCGCCAG CGACCTGGAG CTGCACCAGG CCGCCACCAT CGCGGGCCTG
GTGCGCTACC CCTACCTCTA CAATCCCCGC TTCTTCCCCG AGCAGACCAC CGACCGCCGC
AACGTCGTGC TCGACCGCAT GGTGGCCACC GGCGCGATCA CCGAGGCGGA GGCCGAGGAG
GCCAAGCAGG TCCCGATGGA GGTGGAGATC TCCACCCCGC CCAACGGCTG CGTGCCCAGC
GACCAGCCGT TCTTCTGCGA CTACGTGGTC CAGGAGATCG AGAAGGACGA GCGCTTCGGC
CAGAACGAGA CCGAGCGGGC GCGCTGGCTG CGCACCGCCG GACTCCAGAT CCACACCACG
CTCGACCCGC AGACCCAGGT GTCGGCGCAG GAGGCCGTGG ACGAGTGGGT CCCGCGCGAG
AACGAGTCCC GCAAGGTGGC CGCCGAGGTG GTCATCCAGC CGGGCACCGG CCACATCCTG
GGCATGGTGC AGAGCCGCAA CTACGGGCCC GACGAGAGCA AGCTCGGCGA GACCTCCATC
AACTTCGCCA CCGACGCGGA CCGGGGCGGC AGCAGCGGCT TCCAGGCGGG TTCCACGTTC
AAGGCGTTCA CGCTGGCGGC GGCGCTGGCC GAGGGGATGC CGTTCGGCAC CTCGTTCAAC
TCACCGGCCA GCGTGACCGT CTCCGGGCAG CGCAGCTGTA ACGGCGGGAC CCTGAGTCCC
TGGTCCCCCA GCAACTCCGG CGACACCAAC TCCAACACCA CGCACAACAT GATGACCGGG
ACGAGGGCCT CGTCCAACAC CTACTTCGCC CAGCTCCAGG CGCGGGTGGG CTTCTGCGAC
GTGATGGAGA TGGCCGAGAA CCTGGGGCTG CGGCGCGCGG ACGGCACGCT GTTCACCGAG
AACGAGCGCT CGCTGGCCAA CAGCAGCTTC ACGCTGGGCA GCGAGGAGGT CTCCCCGCTG
CGGGTGGCCA ACGCCTACTC GGTCTTCGCC TCGGGCGGGA TCCTGTGCGA GCCGCAGGCC
ATCACCAAGA TCATCGACCA GCACGCGCAC GAGACGATCG AGATCGAGCC CGAGTGCGAG
CGGGTGATCG ACGAGGAGGT GGCCGACGGG GTCAGCTACC TGCTGGAGCA GCCCTTCAAC
GGCGGCACCG CGGCCTCCCT GGGCATCGGC CGTCCGGCGG CGGGCAAGAC CGGTACCACC
GACGGCTCGG CCGCGGCCTG GTTCGCCGGG TTCACCCCGC AGATGGCCGG TGCGGTCTTC
GTCGGCGACC CGCGCGGCCC GCAGCAGTAC CCGCTGCGCA ACGTCACCAT CGGCGACCGG
TACTACGGGG TGGTGTACGG GGCGACGATC CCGGGCCCGA TCTGGCAGGA GACCATGCGC
GGCGCCCACG AGGGCCTGGA CGTCGAGCAG CTGCCGTCGC CGCCGTCGCA GTTCGGCTCC
ACGTCGGCTC CGCGCACGGA GGAGGAGGCC AGCCCGGCCA GCGACGACGG CGGCGTGCCG
GACGTGATCG GCCGGTCCGA GCAGGAGGCG GTGAGCGTCC TGGAGGCCGC GGGCTACACG
GCGAACGTGT CCGCGACCCG GGTCCGCTCC CCCGAGCCCG AGGGCACTGT GGCCGCGGTC
AACCCGGACC CGGGGACGCG GCTGCCGGAG GGGGCCACGG TCAACGTGTT CCTCAGCAGC
GGCGGCGGGA CCGCCAGCGG GCCGGAGGAC GACGAGGACT GGTTCCCGGT CGCGCCCGCG
TCGAACTCCC CCGGCCGGGA GGACGACTGA
 
Protein sequence
MLQRISQLVG VGVVAGLLVA ALALPAVGGL GITARNVASG FLEMPSNLET PPPPQRSTIY 
DSEGGVIAEI FDQNRELVEI EDVSPVMIDA ILAIEDAHFY EHGGMDVSGT LRAALRTLSG
DTQGASSITQ QYVKNVQIEA ATTQEELEEA SETTIARKIR ELRYAVALEQ RMTKDEILEG
YLNIAYFSDG AYGVESAAQH FFQVPASDLE LHQAATIAGL VRYPYLYNPR FFPEQTTDRR
NVVLDRMVAT GAITEAEAEE AKQVPMEVEI STPPNGCVPS DQPFFCDYVV QEIEKDERFG
QNETERARWL RTAGLQIHTT LDPQTQVSAQ EAVDEWVPRE NESRKVAAEV VIQPGTGHIL
GMVQSRNYGP DESKLGETSI NFATDADRGG SSGFQAGSTF KAFTLAAALA EGMPFGTSFN
SPASVTVSGQ RSCNGGTLSP WSPSNSGDTN SNTTHNMMTG TRASSNTYFA QLQARVGFCD
VMEMAENLGL RRADGTLFTE NERSLANSSF TLGSEEVSPL RVANAYSVFA SGGILCEPQA
ITKIIDQHAH ETIEIEPECE RVIDEEVADG VSYLLEQPFN GGTAASLGIG RPAAGKTGTT
DGSAAAWFAG FTPQMAGAVF VGDPRGPQQY PLRNVTIGDR YYGVVYGATI PGPIWQETMR
GAHEGLDVEQ LPSPPSQFGS TSAPRTEEEA SPASDDGGVP DVIGRSEQEA VSVLEAAGYT
ANVSATRVRS PEPEGTVAAV NPDPGTRLPE GATVNVFLSS GGGTASGPED DEDWFPVAPA
SNSPGREDD