Gene Ndas_4845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4845 
Symbol 
ID9248731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5738917 
End bp5741853 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycosyl transferase family 51 
Protein accessionYP_003682734 
Protein GI297563760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.398226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTACCG AACGACGACG GAACGCCGAT GGCGGCCGTC GCCGGGCCGA CGGCCCGGCC 
GACGGCTCGC GCGGCGGCGG CGGACGACGC CGGGCCGACG GCCCGCCCCG GGACGAACCC
TCCGGAGGCC GCCGCCGCCC GGACGCCGGT GACGGCTTCT GGGGGGACGA CACCGGGCCC
GAGCGCCCGC GCCGCCGACG CCCCCCGGAG CAGGATGCCC GTTCCGAGGG CCGCCGCCGC
TCCGAGGGCG ACCGCCCCCG CCGCCCGGAG GGACAGCGCC GCGCCCCCGC CGACGGTGAC
CGCCCCCGCC GCCGCCCCGA GGGCGAGCGC CCCCGGCGGC CCCAGGACGG CGAGCGCTCC
GGCCACCGTT CCGAGGGACA GCGCCGCCGC CCGGACGAGG ACCCCCGCGG CCGCCGCAGG
GCGGCCGGTT CCGGACCGCG CGACCCCCGG GACCGCGACC CCCGCGACCC CCGCGACCGC
GGCCGTGACC GTGAGAACGC CGCCCGCTCC GAGGGGCGGC GAGCCCCGGG CCGCGGCTCC
CGCGCCGCCG CCGGCGGCGG CAGAGGAGGC CGGGGAGGCG GCCGGCGCCG CTCCCGCGAC
GAGGAGCCGG ACGACCGCCC CTGGTTCAAG CGCTTCCTGT CCAAGGCGTG GAAACCCGCC
CTGGCCGTCT TCGGCCTGAT GATCATCGGC GGCGTGGCCG CCTTCGCGAT CCTGTACGCC
ATGGCCCCCA ACGTGAACGA GCTCGAGGCC AAGTCGGACG CCAACCTGTC CGCCACCCAG
ATCATGTGGG CTCCGGAGGG CGACGCCGAG CCCGAGGTCG CGGTCACCAC GGGCGAGGTC
AGGCGGGTCG AGGTCCAGAG CGAGGACATC CCGCAGACGG TCATCAACGG TGTGCTCGCC
GCCGAGCAGC GCACCTTCTA CACCGACCCC GGCATCAGCA TCTCCGGGAT CGGGCGCGCC
CTCCTCTCCG GCGGCGAGGC GGGCGGCGGT TCCACCATCA CCCAGCAGAT GGCGCGGAAC
TACTACAGCG ACCTGGAGAT CGAGAACCAG TACATCCGCA AGATCCGCGA GATCTTCATC
TCCATCAAGC TCAACCAGCA GATGGAGAAG GAGGAGATCC TCTCCACCTA CCTCAACACC
ATCTACTTCG GCCGCAACGC CATGGGCATC CAGATGGCCG CCCAGTCCTA CTTCGGCAAG
GACCTGGACG AGCTGACCGA CGCCGAGGGC GCCTTCCTGG GCATCATCAT CCAGATGCCC
AGCAACTTCC AGGGCGAGAT GGGCGACTGG ACCAGGACCT ACCTCAACGA GGAGCGCTGG
CCCTACGTGC AGAACCAGCT GGCGCTCATG CACGAGGAGG ACCCCGACCT CGGCCTGCCC
AGGGCCGAGG CCGAGGCCCT GGAGATCCCC GAGCTCGTCC CGTACGGCAC CGAGGAGGGC
GAGGAGGAGC AGGAGTACGA CCCCAAGTTC GACTACGTCC GCCAGGCGGT CGTCCAGGAG
ATCGAGGAGC GCTACTCCGG GATCACCGCC CAGGACATCG CCACCCAGGG CTTCGTCGTG
CAGACCTCGC TGGACCCCGT GCTCATGGAC GCCGCGCGCA GCTCCTTCGA CGTCCTGCCG
AACACCCCCG AGGACACGAT GATGGGCCTG ACCGCGGTCG ACCCCGCCAC GGGGCAGATC
GTCGCCTTCC ACGGGGGCGA CAACGTGGTC GAGGACGTCA ACAACTCGCT GACCCACCGG
ACGCAGGCGG GCTCGTCCTA CAAGCCCTAC GTGCTCGCCA CCGCGCTGGA GAACGGCATC
AGCCTCAACA GCACGTTCGA CGGCGACTCC CCCCAGGAGT TCCCGGGGCT GCAGAGCGAG
GTCATCAACG CGAGCGGCAG GAGCTGGGGT CCGGTCAACC TGATCCAGTC CACCGCCCAG
TCGATCAACA CCCCGTTCGT GGAGCTGGCG GTGCAGATGA CCCCGGCCGC GGTGGACGAG
CTCGCGGTCC AGGCCGGTGT GGCCGAGGAG CAGACCACGA CCTCCGCCCA GGGCCCGCTC
ATCGCGCTGG GCACCCACCA GGTGAGCGCG CTGGACCAGG CCGAGGGCTA CTCCACCTTC
GCCGCCGGGG GCGTGCACCG TCCGGCGCAC ATGGTCACCG AGCTGCGCAC CTCCGACGGC
CAGGTCATCC CGCCCGACGA CGCCGACCTG CTGGAGAACG GTGAGCAGGT GGTCACCGCG
GGCGTCGCGG CCGACGTCAC CTACGCCATG ACCCAGGTCG TCGAGCCCGG CGGCGGTGGC
GACCAGGCGG CGCTGCCCGA CGGGCGCCCC GTCGCGGGCA AGACCGGTAC CTCCAGCGAC
GCGGTCTCCG CGTGGTTCGT GGGATACACG CCGCAGCTCG TCGCCGCGGT GGGCCTGAGC
CGCGCGGACG CCAACCAGGC CCTGGAGTTC GACGGCCAGA CCGCGGGGGA GATCTTCGGT
GGCACCACCT CGGCCAACGT GTGGCGCGAG TTCATGACGA CCGCCATGGA GGGCGTCGAA
CCGGCGCAGT TCCCGCCGCC CGCCTACGTC GGTACGGAGC AGAGCTTCGT GCCCACGCCC
TCCCCGAGCG CCGAGGAGAG CGACGAGCCG AGCGACGAGC CCTCGGTCGA GGAGTCCCCG
AGCGAGAGCC CGGCGGAGTG CGACCCGTCC ATGCCGGACT CCCCGGAGTG CCAGGACCAG
CAGACCGATG AGCCCTGTGA GCCCGGGTGG GGCCGGGACT GCCCGGAGAG CCCCGGCTCG
GGGGACCAGG AGGAGTGCGG TGGCTGGGGC CAGCCCGCCT GCGAGGACAC CGATCCCAGC
GGTGAGGCCT CCGACCCCGA GCAGGACGGC GGGGAGCAGG GCGGCGGGAT CTTCGGGAGG
ACGACCAACA CCGGCGGCGA GCCCGGAAGG TTGGTGATCC TCGGCCGGAA CGACTGA
 
Protein sequence
MSTERRRNAD GGRRRADGPA DGSRGGGGRR RADGPPRDEP SGGRRRPDAG DGFWGDDTGP 
ERPRRRRPPE QDARSEGRRR SEGDRPRRPE GQRRAPADGD RPRRRPEGER PRRPQDGERS
GHRSEGQRRR PDEDPRGRRR AAGSGPRDPR DRDPRDPRDR GRDRENAARS EGRRAPGRGS
RAAAGGGRGG RGGGRRRSRD EEPDDRPWFK RFLSKAWKPA LAVFGLMIIG GVAAFAILYA
MAPNVNELEA KSDANLSATQ IMWAPEGDAE PEVAVTTGEV RRVEVQSEDI PQTVINGVLA
AEQRTFYTDP GISISGIGRA LLSGGEAGGG STITQQMARN YYSDLEIENQ YIRKIREIFI
SIKLNQQMEK EEILSTYLNT IYFGRNAMGI QMAAQSYFGK DLDELTDAEG AFLGIIIQMP
SNFQGEMGDW TRTYLNEERW PYVQNQLALM HEEDPDLGLP RAEAEALEIP ELVPYGTEEG
EEEQEYDPKF DYVRQAVVQE IEERYSGITA QDIATQGFVV QTSLDPVLMD AARSSFDVLP
NTPEDTMMGL TAVDPATGQI VAFHGGDNVV EDVNNSLTHR TQAGSSYKPY VLATALENGI
SLNSTFDGDS PQEFPGLQSE VINASGRSWG PVNLIQSTAQ SINTPFVELA VQMTPAAVDE
LAVQAGVAEE QTTTSAQGPL IALGTHQVSA LDQAEGYSTF AAGGVHRPAH MVTELRTSDG
QVIPPDDADL LENGEQVVTA GVAADVTYAM TQVVEPGGGG DQAALPDGRP VAGKTGTSSD
AVSAWFVGYT PQLVAAVGLS RADANQALEF DGQTAGEIFG GTTSANVWRE FMTTAMEGVE
PAQFPPPAYV GTEQSFVPTP SPSAEESDEP SDEPSVEESP SESPAECDPS MPDSPECQDQ
QTDEPCEPGW GRDCPESPGS GDQEECGGWG QPACEDTDPS GEASDPEQDG GEQGGGIFGR
TTNTGGEPGR LVILGRND