Gene Ndas_3973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3973 
Symbol 
ID9247844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4752067 
End bp4753644 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003681876 
Protein GI297562902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0772213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGAAC AGACCACGCG TCCCGTCCCG CGCCCCGACG ACAGCGGCCA CCCCGACGTG 
GTCGTCAGCC TCGGCACCGA CCACCACTCC TTCGACCGGC TGGTCCGCTG GATCGACGAC
TACGCCCGGC GCCACCGGAC CCTGCGCTTC CTGGTCCAGC ACGGGCACAG CGCCGCCCCC
GAGGTGGCCG CGGGCACCCC GTTCCTGCCC GGCGAGGAGC TCGGCGAGCA CATGCGCCGG
GCCCGGGTGG TCGTCGCCCA CGGTGGACCG GGCACCATCG TCCAGGCCCG CCGCGCCGGA
CGCCTGCCCA TCGTCGTCGC CCGCGACCCC GAACTGGACG AGCACGTCGA CGAGCACCAG
CTCCTGTTCG TACGGCGTCT GGAGGAGGCG GGCCGGGTGC GTTCCTGCGC CACCCCCCAG
CAGCTCTGCG CGCTCCTGGA CAGGGCGCTC GCCTCGCCCG CGGACTTCCG GGTGGACCCC
GGCGACGGCG AGGGCACCGA GCGGGCCGCG CTGCGCGCCG GGGAGCTCAT CGACCTGCTC
ACGCGGGGCC GGGGCGCGAC GGCCGAACCC GTCGCCACGG CCGCCGCCCC CTCAGTCCCC
CGGGCCGCCG ACCACATCTT CGGCCGCACC GCCGCGCGGA CCGCCGCCTC CGGAGCCGAC
GACACCGGTC CCCTGCCCGG CGTGACCGTG GTGGTGCCCA CCCGGGACCG GCCGGAACTG
CTGCGCCGCA CCCTGCGGGC GATCAACGAG CAGGACTACT CCGGCCGCAT CACCACGATC
GTCGTCTTCG ACAACGACCA GCCCGACCCC TCACTGGCCC GCTCCGACGG CGACCGCCCC
GTGCGGGTGG TCACCAACAC CCTCACCCCC GGCCTGGCCG GGGCCCGCAA CACCGGTGTG
CTCGCCGCCG ACACCGACCT GGTGGCCTTC TGCGACGACG ACGACACGTG GCTGCCCGGG
AAGCTCCGGG CACAGGTCGG CGTCATGCTC GACGAGCCCG GCACGGAGAT GGTGTGCTGC
GGCATCCGGG TGGTCTACGA CAGGGTCGAG GCGGTCCGCA GCCTGGACCG CACCAGCGTG
ACCTTCGGCG ACCTGCTGGG GTCGCGCCTG ACCGAGCTGC ACCCGTCCAC GTTCCTCATC
CGGCGCCGCG CCATGATCGA CGGCTGCGGA ACCGTCAGCG AGGAGATCCC CGGCAGCTAC
GCCGAGGACT ACGAACTGCT GCTGCGCCTG GCCCGGCGCG GCCCCATCCG CAACATCCCC
GAACCGGGCG TGCGGGTGCT GTGGCACCGC AGGTCGCACT TCTCCGGGCG CTGGCGGACC
ATCTCCACCG CCCTGCGCTG GCTGCTGGAC CGCTACCCCG AGTTCGCTCT GGTGCCGCGC
GGCCACGCGC GCGTGGCCGG GCAGATCGCC TTCGCCGAGG CCGCCTCCGG CCGCCGACGC
GCGGCGCTGC GCTGGATCGG CACCACCGTC CGCAGCCGCC CGGCCGAGGC CCGCGCCTAC
CTGGCGCTGG CGGTGGTGCT CGGGGTGCCC GCCGGGTGGG TCACGCGCGC GCTGCATCTG
CGCGGCAGGG GCCTGTAA
 
Protein sequence
MTEQTTRPVP RPDDSGHPDV VVSLGTDHHS FDRLVRWIDD YARRHRTLRF LVQHGHSAAP 
EVAAGTPFLP GEELGEHMRR ARVVVAHGGP GTIVQARRAG RLPIVVARDP ELDEHVDEHQ
LLFVRRLEEA GRVRSCATPQ QLCALLDRAL ASPADFRVDP GDGEGTERAA LRAGELIDLL
TRGRGATAEP VATAAAPSVP RAADHIFGRT AARTAASGAD DTGPLPGVTV VVPTRDRPEL
LRRTLRAINE QDYSGRITTI VVFDNDQPDP SLARSDGDRP VRVVTNTLTP GLAGARNTGV
LAADTDLVAF CDDDDTWLPG KLRAQVGVML DEPGTEMVCC GIRVVYDRVE AVRSLDRTSV
TFGDLLGSRL TELHPSTFLI RRRAMIDGCG TVSEEIPGSY AEDYELLLRL ARRGPIRNIP
EPGVRVLWHR RSHFSGRWRT ISTALRWLLD RYPEFALVPR GHARVAGQIA FAEAASGRRR
AALRWIGTTV RSRPAEARAY LALAVVLGVP AGWVTRALHL RGRGL