Gene Ndas_0884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0884 
Symbol 
ID9244729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1081028 
End bp1083430 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content76% 
IMG OID 
Producttransglutaminase domain protein 
Protein accessionYP_003678834 
Protein GI297559860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGG CCACCCTGGC CTGCCTGTTG ATGGCGATGC CACTGCTGGA CGGCCTCGTA 
CGGGGCGAGG CCTGGTGGGT CCCGGCCCTC GTGATGACGG TGGCGGTGGG CGGGGTGTCC
GCGCTGTACC GCCTCAGCGG CTGGAGCTCC TTCCCGGTGC CGTTCCTCCA GGTGCTGGCG
GCGGCGCTGC TGATGACGCC GCTGTTCGCC GGACACGTCG CGCCGCTGGG GCTGCTGCCC
TCCGGTGACA CGGCGCTGCA CCTGCTGCGG GTGTTCGACG AGGGCCTGGA GACGATCGAC
ACCAGCACGC CCCCGGTCTC CTCCACCGCC GGGGTCATGC TGATCATCGC GCTGGTGTTC
GTCCTGTTCG CGATCATCGC GGACTTCCTG GCGGTGACCG CGCGCTGCCC CGGGATGGTC
GGCGCCCTGG TGGCCGTGCT GATGGCGGTC CCGCTCATCG TGGACGACGC CGGACTGGGC
TGGCCGGCCG CGACCAGCGG CGCCGTCGGC TTCCTGCTGC TGCTCGCGGT GGACGTGTGG
GTGCGCGGCC GCGAGTGGGG CGTGCGCGTC CCCGACGGCC ACGACTCCTC GGCCCGTGTG
CTGGGCGCGG TGGGCAGGGC GACGACGGTG GCGGTGGCCT CGGCCGCCGC GGTGCTGCTG
GCGCTGACGG TGCCGCTCGC GGTGCCGTCG CTGCGCACCG ACGTGCTCCA CACGATGGCC
GACGGCACCT ACATCGGCAC CGGCGGGGAC CGCATCACCA CCACGCACCC GCTGGTCTCG
CTGCGGCGCG AGCTGGCCTC CTCCTCCGAC CGCACCGTGC TGACCTACCG GACCGACGCC
GAGCGGCCCG ACTACCTGCG CACGTACGTG CTGGACGAGT TCGACGGCGC GAACTGGACG
ATGACGCCGG TCAGCGCCTC GGGCGACACC CTGGTCGACG GGGAGCTGCC GCTGCCCACC
GGGTGGGGCT CGGAGCCCTC AGGCGACGCG GTGACCACGC GGATCTCGAT GGACTCGGAC
GCGCCCGGGA TCGACTTCCT GCCGTTGCCG TACTGGGCGC GGACGGTGGA CGTGCCCGGG
GAGTGGTACG CCGACCCGGC GAGCCACGTG GTGTTCACGA CCGAGAGCGC GCCCACGGGG
CTGAACTTCA CCGTGCAGAC CCTGGGCCGG GAGCCGTCCG CCGAGGAGCT GGCCGACAGC
GGCAGCCCGC GGTCGCTGCC GGGCGACTTC CGCGACCTGC CCGGCGACCT GGACCCGCGG
GTCCGGGAGC TGGCCGAGGA CCTGACCGAG GACGCGCGGA GCCCCTACGA GCGGGCGGTG
GCCATCCAGG ACCACTTCAC GGGGGGCGCG TTCACCTACG ACCTGTCCCC GCCCGCGGTG
CCCGACGGCG CCGACCCGCT CGCCCACTTC CTGTTCGGGG ACCGGGTGGG CTACTGCGAA
CAGTTCGCCG GGGCGATGGC GGTCATGGCC CGACAGGTGG ACATCCCGGC GCGGGTGGCG
GTGGGCTACA CCGCGGGCGA GCAGCAGGGC GACGGCCGCT GGGCGGTGTC CGTGGGCGAC
GCCCACGCCT GGCCGGAGCT GTACTTCGAG GGCGCGGGCT GGGTGCGGTT CGAGCCGACC
CCCTCCTCCG CCGGGGGGCA GGGCTCGGCG TCGGTGCCCG ACTACTCCCG GGGCGGGCAG
GGGGGCGTCG ACGACCCGGA CGGGGCCGCG GAGGAGACGC GGGAGCCCTC GCCCGAGGAG
ACGGAGGCCG CGCGGGACGA GGCCACCGCG GAGGCCGAGG AGCCCAGCGG GACCCCCGAG
GCGGAGCCCG AGGACGAGCC CTCCACGTCG GTGGCCGCCC CGGGCGACGA CGCGCGGGGC
GGCCCGGACC TGTCGTGGCT GCCCGCGGCG GGCGCCGCCA CGGGTGTGCT GCTGCTGTTG
GCCCTGCCCG CGCTGGTGCG CGCCCTGACG CGCTGGTCCC GGACGGCGTC GCTGACCGGC
GGCGCGGGAG CTGCGGGCGC GCACACCGCG TGGCGGGAGC TGCGGGACAC CTGTCTGGAC
CTGGGCGGCG CGTGGTCGCT GGCGGAGAGC CCGCGTGCCA CGGCCGAACG CCTGGCCGGT
TCCGGTCCGG TGCCCCAGCC CGACGCGGCC CCGGCGGGGC TGATGTCGGG GACCGTTCCG
GGCCCGGTGC CGCCGGAGGC CGCGGCGGCG CTGCGGCGGC TCGCCCTGGC CGAGGAGGAG
TCGCGCTACG CGCCCTCGCC CCGGACCCCG GAAGGGCTGC GCGAGGACCT GCGGACGGCT
CTGGCGGGGC TGACCGGTGT GGTCGGCGCG GGAACGCGCG TCCGCGCGGT GCTGCTGCCC
CGGTCGCTGG CGCCGTGGCA CCGTCCGCGC CGCGCGGCCG ACCCCGAGCC CGTGACGCCC
TAG
 
Protein sequence
MPLATLACLL MAMPLLDGLV RGEAWWVPAL VMTVAVGGVS ALYRLSGWSS FPVPFLQVLA 
AALLMTPLFA GHVAPLGLLP SGDTALHLLR VFDEGLETID TSTPPVSSTA GVMLIIALVF
VLFAIIADFL AVTARCPGMV GALVAVLMAV PLIVDDAGLG WPAATSGAVG FLLLLAVDVW
VRGREWGVRV PDGHDSSARV LGAVGRATTV AVASAAAVLL ALTVPLAVPS LRTDVLHTMA
DGTYIGTGGD RITTTHPLVS LRRELASSSD RTVLTYRTDA ERPDYLRTYV LDEFDGANWT
MTPVSASGDT LVDGELPLPT GWGSEPSGDA VTTRISMDSD APGIDFLPLP YWARTVDVPG
EWYADPASHV VFTTESAPTG LNFTVQTLGR EPSAEELADS GSPRSLPGDF RDLPGDLDPR
VRELAEDLTE DARSPYERAV AIQDHFTGGA FTYDLSPPAV PDGADPLAHF LFGDRVGYCE
QFAGAMAVMA RQVDIPARVA VGYTAGEQQG DGRWAVSVGD AHAWPELYFE GAGWVRFEPT
PSSAGGQGSA SVPDYSRGGQ GGVDDPDGAA EETREPSPEE TEAARDEATA EAEEPSGTPE
AEPEDEPSTS VAAPGDDARG GPDLSWLPAA GAATGVLLLL ALPALVRALT RWSRTASLTG
GAGAAGAHTA WRELRDTCLD LGGAWSLAES PRATAERLAG SGPVPQPDAA PAGLMSGTVP
GPVPPEAAAA LRRLALAEEE SRYAPSPRTP EGLREDLRTA LAGLTGVVGA GTRVRAVLLP
RSLAPWHRPR RAADPEPVTP