Gene Ndas_3414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3414 
Symbol 
ID9247281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4081988 
End bp4084021 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content76% 
IMG OID 
Product4-alpha-glucanotransferase 
Protein accessionYP_003681325 
Protein GI297562351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.723218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGACG TTCAGCTGGC ACGGTTGGCC GAGCAGCACG GAGTCGCGAC GACCTACGAG 
GACTGGCGGG GGCGCAGGGT CCCGGTCAGC CCCGACACCA TCCGCCATGT CCTGGCCGCC
CTGGGCGTGG ACCCGGCGGC CCCGGACGCG GGGGCGGCGC CGGGCCCGCT GCCGCCCGCG
GTCGTCGTCC GCGAGGGCAA GGCGCCCGAC CTCGTGCTCG CCGAGGGGAC GCACAGCGTC
GTCGAGACCT GCGGCGGCTC CCTGCTGCCC TTCGGCCCCG GGCTGCCGCT GGGCGTGCAC
ACCCTGCGCG TCACCGACGG CGCCGCCGAG CACACCGCGC CCCTGCTCGT GGCCCCCGAC
CGCATCGAAC CCAGCGCGCT CAGGGACCAC CCGCGCCTGT GGGGGATCAT GGCGCAGGTC
TACAGCGTGC GCTCCCGCGC CTCCTGGGGC ACCGGCGACC TGCGCGACCT CGCCGAGCTC
GCCGACTGGA GCGCGCGCGA CCTGGGCGCC GACTTCGCCC TGATCAACCC GGTGCACGCC
ACCGAGCCGC TGCCCCCGGT GGAGCCCTCG CCCTACCTGC CGGTCAGCCG CCGCTACGCC
AGCCCCCTCT ACATCCGCAT CGAGGACGTG CCCGAGTACG CCCGGCTCGA CCCCGCCCAG
CGCGAGGGCA TCCAGCGCCT GGCCCGGCCG CTGCGCGAAC GCGGCCGCAC CGCCGACCTC
CTCGACCGCG ACTCCGTGTG GAGGGCCAAG CGCGCCGCCC TGGAGGTCCT CTTCGAGTCG
CCCCGCGCCC CCGGCCGCGA GGCCTCCCTG CGCGCCTACC TCGAACGCGA GGGCCGACCG
CTGGTCGAGT TCGCCACCTG GTCGGCCCTG GCCGAGCGGC ACGGCACCGA CTACCGGACC
TGGCCGCAGG GGCTGCGCGG GGTCCACAGC CGCGACGTCG GGGAGGAGGC CCTGCGCCGG
TGGCCCGCCG TGGAGTTCCA CCGCTGGCTC CAGTGGATCC TCGACGAGCA GCTCTCCCAG
GCGCAGACCA CCGCCGAGAC CGGCGGGATG GCCGTCGGCA TCGTGCACGA CCTGGCCATC
GGGGTGCAGG CGGGCGGCGC GGACGCGTGG ATGTACGGGG ACGCCCTCGT GCCCGGCCTG
CACGTGGGGG CGCCCCCGGA CGAGTTCAAC CGGCAGGGGC AGGACTGGGG CCAGCCGCCC
TGGCACCCCC GGCGCCTGGC CGGGCGCGGC TACGAACCGT TCCGCCAGAT CCTGGCCCAG
GCGTTCCGCT ACGCGGGCGG GGTCCGCATC GACCACGTCA TGGGCATGTT CCGGCTCTGG
TGCGTGCCCG AGGGCGCCTC CGCGGCCGAG GGCACCTACG TCAGGTTCGA CCACGAGGGC
ATGGTGGGCA CGCTCGCCCT GGCCGCCCAC CGGGCGGACG CCGTCGTGGT CGGGGAGGAC
CTGGGCACCG TGGAGCCGTG GGTGCGCGAA CACCTCGCCG ACCGCGGGAT CCTCGGTACC
TCCGTGCTGT GGTTCGAGCA GGACGCCGAC GGCCGCCCGC GCCCGCCGGA GGAGTGGCGC
ACCGACTGCC TGGCCACGGT CGCCACCCAC GACCTGCCGC CGGTGGCGTC GTTCCTGTCC
ACCGAGCACG TCGAGCTGCG CGACCGGCTG GGCCTGCTCG GCCGCCCCGT GGAGGAGGAG
CGCGCCGACG CCGAGTCGCG GGTGGAACGC TGGCGCGAGC TCCTCGTGGA GCTGGGCCTG
CTCGACGAGG ACGTGGACCC CGTGGCCGAC TCGGCCGCCG TGGTGGCGGC CATGCACGCC
TACCTGGTGG CCACCCCCGC CCGCATGATC GGGGTCGCGC TCACCGACGT GGTGGGGGAG
CGGCGCATGC AGAACCAGCC CGGCACCAGC GACGAGTACC CGAACTGGCG CATCCCGCTC
ACCAGCGGCG AGGGAGAGCC GGTCCTGGTG GACGAACTGC TCGCCGACCC GCGCATGGTG
GCGCGCATCC GCCGCACCCT CGGGCCTGTC GGCCGGCGGG ACCACGCTCC CTAA
 
Protein sequence
MRDVQLARLA EQHGVATTYE DWRGRRVPVS PDTIRHVLAA LGVDPAAPDA GAAPGPLPPA 
VVVREGKAPD LVLAEGTHSV VETCGGSLLP FGPGLPLGVH TLRVTDGAAE HTAPLLVAPD
RIEPSALRDH PRLWGIMAQV YSVRSRASWG TGDLRDLAEL ADWSARDLGA DFALINPVHA
TEPLPPVEPS PYLPVSRRYA SPLYIRIEDV PEYARLDPAQ REGIQRLARP LRERGRTADL
LDRDSVWRAK RAALEVLFES PRAPGREASL RAYLEREGRP LVEFATWSAL AERHGTDYRT
WPQGLRGVHS RDVGEEALRR WPAVEFHRWL QWILDEQLSQ AQTTAETGGM AVGIVHDLAI
GVQAGGADAW MYGDALVPGL HVGAPPDEFN RQGQDWGQPP WHPRRLAGRG YEPFRQILAQ
AFRYAGGVRI DHVMGMFRLW CVPEGASAAE GTYVRFDHEG MVGTLALAAH RADAVVVGED
LGTVEPWVRE HLADRGILGT SVLWFEQDAD GRPRPPEEWR TDCLATVATH DLPPVASFLS
TEHVELRDRL GLLGRPVEEE RADAESRVER WRELLVELGL LDEDVDPVAD SAAVVAAMHA
YLVATPARMI GVALTDVVGE RRMQNQPGTS DEYPNWRIPL TSGEGEPVLV DELLADPRMV
ARIRRTLGPV GRRDHAP