Gene Ndas_5534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5534 
Symbol 
ID9249437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp725990 
End bp728251 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content78% 
IMG OID 
Producttransglutaminase domain protein 
Protein accessionYP_003683419 
Protein GI297564446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0283272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCACGC TGACACCGCC ACCCCGCGCC GAGGAGGCTC CGGTCGCTCC CGCGCCGCCG 
GACCCGCCCC CGGAGGGCAT CCCCGTTGCC GCGGGGCTGG TCATGGCCGC CGCCGCGGGC
GGTATCGCCG TCGCCCCCGG ATACGCCGAC CCCGCCCCGG TCCACGTCCT GCTACCCGCC
TGCGCCCTGC TCAGCGTCGT GGTGACCCTG CTGCTGCGGC GCCGGACGAG CGCGCTCCGT
ACCGCCCTGG CCGGGCTGCC GCTCCCGCTG GCGGTGGTGG CGGCGTGCTC GGTGTGGGTG
CCCGGAGAGG GGACCGGGGT GCTCGGCTCC ACCGTGGAGG CGGTCCTGCA CTCGGGGGCG
CGGATCCTCA CCAGCGCCGC GCCGACCCCG CTGAACGCGG ACACCCTCGC CCTGCCGGTA
CTGGCCACCT GGCTGACCGG CGCCGCCTCC GCTCTGGCCT GGCGCGGACG CCGCCGGGCG
CTCGCCCTGC TGCCGGGGCT GCTGCTGCTC GTCGGCGCGG TCGTGCTCAA CGGTCCCGTC
GCCCCGCCCG GCTTCCCCGC CATCGGACTG CTCACGGTGT CCTCGGTCAT CCTCATGTCC
GCGTCCCGGG AGGAGCACGA CCGCGGCGCC CCGGCGGGCC GGACGGCGCT GAGCGTCGAG
GTCGACACCC CCGGCGGCAC CCCGCCGGGA AGACCGCGCC GGGTACTGGT GACCGGGGTC
ATCGCCGTGC TGACCGCCAC GGCCACCGTG TACGGCGGAC CCGTCCTGCT CGCCGGATGG
GACGCGGAGC CGGGCGACCC CCGCACGGTC GTGAGCCCGC CGATGGACCC CCAGGCGGCG
CTCAACCCGC TCGTCTACCT GTCCGGCTGG GCCGCCGACC CCGACGAGCC GCTGCTCACC
GTCGCCGCCG ACGAACCCGT CAGCCTGCGC TGGGTCACCC TCTCCGACTT CACCGGCACC
ACCTGGCTGC CCGAGGGCGG CTACCGGGCG GCGGGTCAGG AGCTTCCCGA GCCCGTGCCG
CCCCTGCCCC ACGCCACCGG GGTCAGCGCC GAGATCACCG TCGGGGAGGA CCTGCCCGGC
AGCTGGGCCC CGGTCGTCGG CGCCCCCCGG CGGATCGGCC TTCCCTCCCC GGGCTACAGC
GCGCTCTCGG GAACCGTGGT GAACATGGAC GGCGCCGTCG CGGGCGCCCG GTACCAGGTC
ACCGGCGACG TCGCCGACTG GCGTCCGGGC GAGCTGGCCG GGGCCTCCAC GCCCGCGGAC
GAGATCTTCG ACCGCTACCG CGAACTGCCC GCCGGGGCGC CGTCCGTGCT CAACGACGTC
GTCGCCGCGG TCGCCAGCGA GGGTTCCCCG TACCAGCGCG CCAGCGCCCT GGCGGAGTAT
CTGAGGCGGT CGCACCGGTT CGACCCCGAG ACCCCCGGCG GCCACGGCTA CGCCAACGTC
GCCGCGGTCC TCGCCCCGCC CGGGGCGGAG GGGGGCGCGG GCACCTCCGA GCAGTTCGCG
AGCGCCTTCG CCATGCTGGC CCGCGCGGCC GGACTGCCCA GCCGCGTGGC CGTCGGCTTC
GGGGCGGGCA CGGAGGACGC CGGCGGTACC CGCACCGTCC GCACGGGCGA CGCCGTGGCC
TGGGGCGAGG TCTACTTCGA CGGCGTCGGA TGGGTGCCGT TCGCCGTCAC CCCCGGGGAG
GAGGGCGGGG ACGACGCGAG CGGGTCCGCG CGGAGCGAGC GGACCCCGGA GGGAGCCGAT
TCCGCGCAGG ACCAGGTCCT GCCCGAGGAC GACGCCCACG ACGTCGCCGC GCCCCGGCGC
GAGCCGGAGC GGTTTCCGGT GTGGGGGGTG CCCGCTATCG CGGGCGGTCT GCTCGCCGCC
GTGCTCGCCG TCCCCGCGCT GCGCCTGGCC CGCAGCGGGC GGCGGCTGCG CGCGGGCACG
CCCGAGCGCC GCGTCCTGGG CGCGTGGCAC GAGCTGCGCG ACGGCCTGCG CCTGTGCGCG
TCGGAGCCGC CGCCGGGGCA CACGGTCAGC GACACCGTCG CGCTCGCCCG GAGCCTGCTG
CCCGAGACCG CCCAGGCCTG GGCCCACGTG GACCTGGGCA GGCTGGGCCG CACGGTCAAC
GGGATCGGGT TCGCCGCGGG GCCGGGCGTC ACCGGGGAGC AGGCGGCCTC CATCGCCGAG
GGCGTGCGCC GCCAGCTGCG CGCGCTGCGT CTCAGCAGGT CCCGTACCAG GAGGCTCACC
TGGTGGTTCG ACCCCCGTCC GCTGCTGTGG CGGGAGCGGT GA
 
Protein sequence
MVTLTPPPRA EEAPVAPAPP DPPPEGIPVA AGLVMAAAAG GIAVAPGYAD PAPVHVLLPA 
CALLSVVVTL LLRRRTSALR TALAGLPLPL AVVAACSVWV PGEGTGVLGS TVEAVLHSGA
RILTSAAPTP LNADTLALPV LATWLTGAAS ALAWRGRRRA LALLPGLLLL VGAVVLNGPV
APPGFPAIGL LTVSSVILMS ASREEHDRGA PAGRTALSVE VDTPGGTPPG RPRRVLVTGV
IAVLTATATV YGGPVLLAGW DAEPGDPRTV VSPPMDPQAA LNPLVYLSGW AADPDEPLLT
VAADEPVSLR WVTLSDFTGT TWLPEGGYRA AGQELPEPVP PLPHATGVSA EITVGEDLPG
SWAPVVGAPR RIGLPSPGYS ALSGTVVNMD GAVAGARYQV TGDVADWRPG ELAGASTPAD
EIFDRYRELP AGAPSVLNDV VAAVASEGSP YQRASALAEY LRRSHRFDPE TPGGHGYANV
AAVLAPPGAE GGAGTSEQFA SAFAMLARAA GLPSRVAVGF GAGTEDAGGT RTVRTGDAVA
WGEVYFDGVG WVPFAVTPGE EGGDDASGSA RSERTPEGAD SAQDQVLPED DAHDVAAPRR
EPERFPVWGV PAIAGGLLAA VLAVPALRLA RSGRRLRAGT PERRVLGAWH ELRDGLRLCA
SEPPPGHTVS DTVALARSLL PETAQAWAHV DLGRLGRTVN GIGFAAGPGV TGEQAASIAE
GVRRQLRALR LSRSRTRRLT WWFDPRPLLW RER