Gene Ndas_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0542 
Symbol 
ID9244383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp666221 
End bp668647 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003678495 
Protein GI297559521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATCG CCTATCTGAT CAACGACATG TACGGCATCG GCGGAACCGT GCGCACCGTC 
GCCAACCAGG CGGCGGCGCT GTCCGCCCGG CACGAGGTCG AGATCGTCTC GGTGTTCCGG
CACCGCGCCG AGCCCGTTCT GCCCGTACCC GCGCAGGTGC GCCTGCGTCC GCTCGTGGAC
GTGCGCGGGT ACGACGGCGA CCAGGGCACG GGGACGGACG GCCGCCCGCT GTACGAGGGA
TCGGAACGGG CCGGTGTACC GCCGCTGCTC TTCCCGCCGG AGGACGCGCG CGGTCCCACA
CACAGCCGTC TGACCGACGA CCGGCTGGAG GAGTACCTGG CCACCACCGA CGCCGACGTG
GTGGTGGGCA CCCGGCCTGG GCTGAACGTG GTGATCGCCC GGGCCGCTCC GAGCCGGGTG
GTCAAGATAG GCCAGGAGCA CCTGACCTAC GACCAGCACT CCGAGCCGCT GCGCCGGGTG
ATGGAGGAGA CCTACCCCCA TCTGGACGCC TTCGTGACGG TGACCGAGGC CGACGCGCGC
ACGTACCGGG AGCGGATGCC GATGCCGCGG GTGCGGGTGC TGTCCATCCC GAACTCGGTG
CCCGCGCCCG CGTTCACCGC CAACGGGCAC AACAGCCGCA CGATCGTGTC CGCCGGGCGG
CTGGCCCCCA GCAAGCGGCA CGACCTCCTG GTGCAGGCCT TCTCCATGGT CAGCGAGGAC
TTCCCCGAGT GGACGCTGCG GATCTACGGG CGCGGCAACC GGCGTGCGGC GCTGGCCCGG
CTGGTCAACT CGCTCGGGCT CCAGGACCGC GTGTGGCTGA TGGGCGCGCA CCCGCGCGTG
GAGGAGGCCT GGGCCCAGGG TTCGTTCGCG GCGGTGACCT CCAGTGAGGA GCCGTTCGGG
ATGACGATCG TGGAGGCCAT GCGCTCGGGG CTGCCGGTGG TGAGCACCGA CTGCCCGCAC
GGCCCGGCGT CGATCATCCG CGACCGCGAG GACGGGCTGC TGGTGCCCAA CCGGTCCGCC
CCGGGGATCG CCGACGGGCT GGCCCAGCTG ATGGCCGACG ACGAGCTGCG CCGGCGGATG
TCGTCCGCCG CGCTGACCGG GTCCGAGCGC TTCGACCCGG CCAACGTCAT GCTGATGCAC
GAGGAGCTGT TCTCCGACCT CTCCGGCCAC GAGCTGCCTC CGCCGCCGGT GTCGGTGCCC
GCGCCGCGCC CCGAACCGGC CGCCGCCTGC CGGGTGCGCG TGGACGGGGA CGGGGCCGCG
GTGCTGTCGT TCACCGACCC GCCCGACCGG GTGGTGCTGA CCCTGGAGGG CAAGCGGGTG
GAGCTGGAGC CGGACGGCGG CGACGTGGTG GTGGACGCCG CGCGCGGTCG GCTGGCCGCG
CGCACGTGGG AGGTGCTGCG GCTCGACGGC GACCGGGAGA CCCCGGTCCA CGACGTGGAG
ATCCGCCAGC ACGGCTTCCC GTTGGACCCG GAACGGGTCG GGCGCCTGTC CCTGGTGCTG
CCCTCCCGTT CAGCGAAGGG CCGCCTGGTG CTGCACGTGC GGCGCTCGGC GCACCACGCC
GAGGTGGAGC ACGTGGAGTC GGCCGACGGG CTGATCACCG TGCAGGGCCG GGTGCTGGGC
CGCGACGCGC CCGACGCCCA CGCGCGCCTG GCGGTGCGGC TGCGCACGTC TCCCGGGGCC
CTGCGGGAGT TCTCCGCGCC GGTGGCCGGT GACGGCCGTT TCACGGCGGT GGTGGACCCC
GAGGTCGTGG TGCACGGCCG GTACGGCGAA TCGGAGAAGT GGGAGCTGCG CCTGGTGCTC
TCCGACGGGA CCGAGTGCAC CGTGGGGCGT CACCTGTCCG GTGTGGCGGG CTACCAGAAG
ATCATCCGGT ACCCGGACCA GAAGGTGGAG CGGGGCGAGG GGTTCGGTTC GCGGGTGCGC
CCCTACTACA CGGTCAACGA CCGGCTGGGG CTCGTGGCGT CGCCGCTGGC TCCCACCCTG
GAGGTGGACC TCGTGCGGGC GAGCCCCGCG GGGCGTGGCG CGGGTCGCCT GCGGTTCGAG
GTGCGTCTGG CCTCCCCGCT CCCGGAGGGC GCGGAGTACG CCGTGGAGGT GGTGCGCGGC
ACCGACGTGG GCCGGCGCTT CCCGCTGCGG GTCGAGGGCG GCTCCGCCTC CGGCGGCCGT
CTGGTGGGCT CCCTGCCGCT GCTGACGGGG TCGGGGTACG GCGGCTCCGG CGCGATCCGG
GCCACGTGGC GGCTGCGGCT GCTGGCGGGG CAGCCGGGGG CTCTGGGGCG CCTGGGCGCC
ACGGCGGTCC CGGACAGCAC CTCGCGCCGC TGGAGCCGCG GCCCGTACGT GCGCTCGGCG
ACCGTGACGC CCCTGGGGTC CGGTGACCTC CAGGTCACCG TCGCCGACGT CCACGCGTGG
GAGGCCGTCC GGCGACGGAT GCCCTGA
 
Protein sequence
MKIAYLINDM YGIGGTVRTV ANQAAALSAR HEVEIVSVFR HRAEPVLPVP AQVRLRPLVD 
VRGYDGDQGT GTDGRPLYEG SERAGVPPLL FPPEDARGPT HSRLTDDRLE EYLATTDADV
VVGTRPGLNV VIARAAPSRV VKIGQEHLTY DQHSEPLRRV MEETYPHLDA FVTVTEADAR
TYRERMPMPR VRVLSIPNSV PAPAFTANGH NSRTIVSAGR LAPSKRHDLL VQAFSMVSED
FPEWTLRIYG RGNRRAALAR LVNSLGLQDR VWLMGAHPRV EEAWAQGSFA AVTSSEEPFG
MTIVEAMRSG LPVVSTDCPH GPASIIRDRE DGLLVPNRSA PGIADGLAQL MADDELRRRM
SSAALTGSER FDPANVMLMH EELFSDLSGH ELPPPPVSVP APRPEPAAAC RVRVDGDGAA
VLSFTDPPDR VVLTLEGKRV ELEPDGGDVV VDAARGRLAA RTWEVLRLDG DRETPVHDVE
IRQHGFPLDP ERVGRLSLVL PSRSAKGRLV LHVRRSAHHA EVEHVESADG LITVQGRVLG
RDAPDAHARL AVRLRTSPGA LREFSAPVAG DGRFTAVVDP EVVVHGRYGE SEKWELRLVL
SDGTECTVGR HLSGVAGYQK IIRYPDQKVE RGEGFGSRVR PYYTVNDRLG LVASPLAPTL
EVDLVRASPA GRGAGRLRFE VRLASPLPEG AEYAVEVVRG TDVGRRFPLR VEGGSASGGR
LVGSLPLLTG SGYGGSGAIR ATWRLRLLAG QPGALGRLGA TAVPDSTSRR WSRGPYVRSA
TVTPLGSGDL QVTVADVHAW EAVRRRMP