Gene Ndas_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0619 
Symbol 
ID9244461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp759467 
End bp761122 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003678572 
Protein GI297559598 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCACGC TTGACATGAC TCCGGAACCG GTCTCCGGCG ACCGCGCGAC CGCGCCGATC 
CTGCCCCGCG CGGAGATCGA CTACCGAGAC CAGCCCCGCG CGCGGCGCAA CGACTACTCC
GTGCTCCAGC CGCCCGCGAT CGGCGAGTGG ACCCCGACCC TGTCGGTGTC GGTGGTCATC
CCCGCCCACG GGCACCAGGA AAAGCTCGAA CTCGTCCTGG CCTCCCTGTC CGCGCAGAGC
TACCCCGCGC ACCTGATGGA GGTGATCGTG GTGGACGACG GCACGCCCGA ACCCCTCACG
CTCCCACCGG TGCGCCCGGA GAACACCCGG CTGATCACCT CGGCCCCCGG CGGCTGGGGC
TCGGCGCACG CCGTCAACAG CGGCGTGGCC GTCTCCTCCG GCCAGGTGGT CCTGCGCCTG
GACGCGGACA TGCTGGTCTA CCGCGACCAC GTCGAGTCGC AGATGCGCTG GCACCACCTG
GTCGACTACG GGGTCGCCCT GGGCCACAAG ATGTTCGTGG ACTTCGACCC GAAGGCCATG
ACCGCCGAGT ACGTGGCCAC CGAGGTGCGC GAGGGACGCG CCGCCCAGCT GTTCGACCGC
GAGAGCGCCG ACCCGCACTG GGTGGAGCAG ACCATCGACG GCAAGGACAA GCTGCGGACC
GCCGACCGCC TGGCCTACAA GGTGTTCATC GGCGCCACCG GCTCCCTGCA CCGCACCCTG
TTCGACGCCG CGGGCGGCCT GAACGGGGAG CTGGTCCTGG GCGGCGACAG CGAGTTCGCC
TACCGGGTCT CCCAGCAGGG CGCCCTGTTC GTCCCCGACC TGGACACCAG CAGCTGGCAC
CTGGGCCGCA CCCAGATGCA GACACGCCGC GACGCGGGCA CCCGCTACCG CGCCCCCTTC
GTGGCCAACC GGGTGCCCGA CTTCCACCTG CGGCGCAAGC GCCCCGACCG GCAGTGGGAG
GTCCCGCTGG TCGACGTGGT GATGGACGTG GACGGCGCCA CCCTGGAGGA CGTGGACACC
ACCGCGTCCG CGCTGCTGTC CGGTACCACT CCCGACATCC GGCTGTGGCT GGTCGGCCCC
TGGGACGTGC TGGACGAGGG GCGGCGCTCG CCGCTGGACG AGGAGCGGCT CGACCTGCGG
CTGATCCGGG AGACCTTCCG GGGCGACCCC CGGGTGCGCC TGGTGGAGGA CGCGCCTGGC
CACGACCCGC TCGTCAACTT CCAGCTGCGT GTGCCCGCCG GTCCGGCCCT GAGGGAGCGG
GCGGTCGTCG AGCTGGTCGA CATGGCCAAC AAGAACAAGG CCGGGCTGCT GTGCTCTCCC
GTGCCCGGCG CCACACGCGG TGACGGCGTC ATGCGCCTGG AGCGGATCGC CGCCTACGCC
AGGGCCCGCC ACCTGTGGCC CGAGGCGACC TCCGAGGAGC TGGACCGCCG GGTGGAGGAG
GTCTACGGCA CCCACTGGGT CCCCGGGACG GACTTCGTGT TCCCGGAGGA GGGGGAGCGG
ACCAAGCCCG AGAACCCCGA GACGCTGCGG CGCAAGCTCG ACCAGGCGCT CGCCGAGGTG
GAGCGCATGC GTGCCCGGGC CAGGCGCGCC GAGCGCAAGC TGCGCTGGTT CACCCCGGGG
CTCACCCGCA GGGCGCTGCG CAAGCTGGCG CGCTGA
 
Protein sequence
MTTLDMTPEP VSGDRATAPI LPRAEIDYRD QPRARRNDYS VLQPPAIGEW TPTLSVSVVI 
PAHGHQEKLE LVLASLSAQS YPAHLMEVIV VDDGTPEPLT LPPVRPENTR LITSAPGGWG
SAHAVNSGVA VSSGQVVLRL DADMLVYRDH VESQMRWHHL VDYGVALGHK MFVDFDPKAM
TAEYVATEVR EGRAAQLFDR ESADPHWVEQ TIDGKDKLRT ADRLAYKVFI GATGSLHRTL
FDAAGGLNGE LVLGGDSEFA YRVSQQGALF VPDLDTSSWH LGRTQMQTRR DAGTRYRAPF
VANRVPDFHL RRKRPDRQWE VPLVDVVMDV DGATLEDVDT TASALLSGTT PDIRLWLVGP
WDVLDEGRRS PLDEERLDLR LIRETFRGDP RVRLVEDAPG HDPLVNFQLR VPAGPALRER
AVVELVDMAN KNKAGLLCSP VPGATRGDGV MRLERIAAYA RARHLWPEAT SEELDRRVEE
VYGTHWVPGT DFVFPEEGER TKPENPETLR RKLDQALAEV ERMRARARRA ERKLRWFTPG
LTRRALRKLA R