Gene Ndas_3981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3981 
Symbol 
ID9247852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4761688 
End bp4763256 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content74% 
IMG OID 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003681884 
Protein GI297562910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCACAA CCAGCCGGAC GAAGACCGTC CACGAACAGT CGACGCGGAT CGTGGTCGGC 
CCCGGAGCCC CAGGGGCCCC AGGGGTCCCA GGAATCCCTG GGGCCGCGGG CGCCACCGGG
CGGCGCGCGG CGGTCGCCTC CGGCGGTGGG ACCAACGGCG CCTGGGTGCG CCCCTACGTC
GTGAGCCTGG TCGGCCTGGA CCTGGCCGCC GCCCTGACGG CCACCCTCAG CGGCGCGGCG
GTCCGCTTCC CCTCCGCTCT CGGAACCCTC ACCACCCTGC CCTACCTGGC CCTCTCCCTC
CTGCTGCCCC CGGTGTGGAT CCTCTTCGTC TACCTGGGCG GCGGCTACGC CCGCCGCTTC
CTGGGCGTGG GCACGGAGGA GTACCGCCGC GTCGCCACGG CCGGGATCGC CCTCGCCGCC
GCCGTGGCCG TCGGCGCCTA CGCCCTGCGC TTCGACCTCG CCCGCGGCTA CGCGCTGGTC
ACCCTGCCGC TCATCCTCCT GCTCACCCTC GGTCTGAGGT ACGCGCGCCG CAAGGCCCTG
CACCGGCGCC GCGGTTCGGG CCAGTGCATG AGCGGGGTCG TGGTCGTCGG CTACCGCGTG
GCCGTCCGCG ACCTGGTCCG CCGGTTCCGC GGCGAGGTCT ACCACGGCAT GCGCGTGGTG
GGCGTGTGCC TGCCCCAGGA GGAGGTCGCC TCCGGTCCGG GCGCCGACGA GGTGGAGGGC
TGTCCGGTCC TGGGCACCTT CACCGGCGCG GCCGAGGCCG CCGCCCTGGC CGGGGCCGAC
ACCGTCGCGG TCCTGGCCTG TCCGGAGATG GACGGGGCCG AGCTGCGCCG CCTGGCCTGG
CGGCTGGAGG AGACCGGCAC CGACCTCATC GTCGCCTCCG CGCTCATGGA CGTGGCCGGA
CCGCGCACCT CCATCCGGCC GGTCGCGGGG CTGCCCCTGC TGCACGTGGA GCACCCCGAA
CTGGTGGGCG CGCGCCGCGT CCTCAAGGGC GCCTTCGACC GCTGCGCCGC CGCCCTGGCC
CTGATACTGC TGTCGCCGCT GTTCCTCGCG CTGTGCGTCC TCGTCCGGGC CGAGGGCGGC
GGGCCCGCCC TCTTCACCCA GACGCGGGTG GGCAGGGGCG GCCGCGAGTT CACCGTCTAC
AAGTTCCGTA CGATGGTGGT GGGGGCCGAG GCGTTGAAGG CGATGCTCCA GCCCCGCAAC
GAGCACGAGG GCGTGCTGTT CAAGATGCGC CGCGACCCCA GGGTGACGGC CGTGGGGGCC
TGGCTGCGCC GGTACTCGCT CGACGAGCTT CCCCAGCTCG TCAACGTGGT CCGGGGGGAG
ATGTCGCTCG TCGGCCCGAG GCCGCCGCTT CCGGAGGAGG TCGCCCGCTA CGGGGACGAC
GTCCGCCGCA GGTTGGTGGT CAAGCCGGGT ATGACGGGTC TGTGGCAGGT GAGCGGCCGC
TCCGACCTCT CCTGGGAGGA ATCGGTCCGC CTCGACCTGC GGTACGTGGA AAACTGGTCG
CTGACACTGG ACGTCCAGAT CTTGTGGAAG ACGTGGTCAG CGGTGATCCG TGGGGCGGGA
GCATACTAG
 
Protein sequence
MVTTSRTKTV HEQSTRIVVG PGAPGAPGVP GIPGAAGATG RRAAVASGGG TNGAWVRPYV 
VSLVGLDLAA ALTATLSGAA VRFPSALGTL TTLPYLALSL LLPPVWILFV YLGGGYARRF
LGVGTEEYRR VATAGIALAA AVAVGAYALR FDLARGYALV TLPLILLLTL GLRYARRKAL
HRRRGSGQCM SGVVVVGYRV AVRDLVRRFR GEVYHGMRVV GVCLPQEEVA SGPGADEVEG
CPVLGTFTGA AEAAALAGAD TVAVLACPEM DGAELRRLAW RLEETGTDLI VASALMDVAG
PRTSIRPVAG LPLLHVEHPE LVGARRVLKG AFDRCAAALA LILLSPLFLA LCVLVRAEGG
GPALFTQTRV GRGGREFTVY KFRTMVVGAE ALKAMLQPRN EHEGVLFKMR RDPRVTAVGA
WLRRYSLDEL PQLVNVVRGE MSLVGPRPPL PEEVARYGDD VRRRLVVKPG MTGLWQVSGR
SDLSWEESVR LDLRYVENWS LTLDVQILWK TWSAVIRGAG AY