Gene Ndas_4880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4880 
Symbol 
ID9248767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp11818 
End bp13386 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content77% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003682769 
Protein GI297563796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.750606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCACC TCGACCGTCT CGTCAACGTC CTGGGCGGCT ACGGCGCCCG GCTCGTCGGC 
CCCGAGCCGC TGCGCCGCAG GGAGCTGCGC AGCGTCGCCA TGCACGATCC CACCGACGAC
GTGCCGCAGC TGGGCGACGC CTTCCTCGCG GTCGGCGTGG ACTCGGCCGC CGAGGCGGTG
CGCCTGGCCG AGCAGGCCCG CGCGACCGTC GTCATCATCC GGTCCGCGAC CGAACCCGGC
GACGACGTAC GCGCACGCCT GGGCGAGGGC GGCCCCGCCC TCCTGGTCGT GGAGCCCGCC
GTGTCGTGGA GCCAGGTCTC CGGGGTCGTC TACGGGCTGG TCCTGGAGGG GCGCGAGACC
GAGGCCGGGC GCGGCCCCAG CGACCTGTTC ACCCTGGCCG ACACCGTCGC CGCCGAGGTC
GGCGGACCCG TCACCATCGA GGACCGCGCC TCGCGCGTGG TGGCCTACTC GGCCTCCCAG
GGCGGCACCG ACCGGGTACG GCGCGACACC CTCGTCGACC GGCGGGTCCC CGAGCGGGTC
CGCGAACGCC TCGACGCCGA CGGGGTGCTC ACCCGGCTGG CCGCCGCGAC CGGCCCCCGG
TTCGTACCGG GGATACCCGA ACTCGACATG GGCGGACGCA CCGCCGCGCC GATCCGGGTG
GGCCGCGAAC TCCTGGGCAC GCTGTGGGTG GCGTGCGACG CGCCCCTGGA CGCGGACCGC
TCCCGGGCGC TGAGCGAGGG GGCGCACACG GTCGGGCTGC ACATGCTGCG CGCCAGGGTC
AGCAGCGACC TGGAGCGGCA GGTGGAGTCG GAGTCGGTGA TCGACCTGCT GGAGGGCTCG
GCCGACCCGG CGCAGACCGC CGGGCGGATC GGCCTGCTCG GGGCGGGGCT GCGGGTGATC
GCCTTCCAGG CACGCGCCCG CACCGAGCCC GAGGCCGCGA TCCTGCACCT GTTCGAGCAG
GTCACCACCG GTTTCGGCTG GTCGCGTCCA GGACGCAGCA CCCTGCTCGG AACCACCGTG
TACACCGTGC TGCCGTGCGG GGACGACCCC GCTCCGGCGG TGGAGTGGGT GCGCTCGACG
CTGCGCGGCC TGCCCGACCG GCTGGGCGTG GTGGCCGGGG TGGGCGGCAC CGCCGACGCC
GCGGGCCTGC CCGCCAGCAG GCAGGAGGCG GACGAGTGCC TGACGCTGCA CGGACAGGCG
CCCGAGGGCG CCGACGCGGT CGTCTACGAC GACGCCTGGG ACTCCGTGCT GCTGCGCCGC
CTGCGCCTGG TGGCCGAGGC CGGGCGCATG CCCGTCCGCA ACCCGGTGGC GGACCTGGTC
CGGTACGACG CCGAGCACGG CACCGACCAC GCGCGCACGC TCCGCGCCTG GCTGTACGTG
CACGGGGACC TGGGGAGGGC GGCGGAGCTG CTGGGCCTGC ACCCCAACAC GGTGCGGTAC
CGGCTGCGCC GGATGGGCCG GGTCGCGGAC CTGCCGCTGC ACGACCCGCG GGCCCGGGTG
GCGATGACGG TCGCGCTCGC CGCGCTCGTG GCGGACCCGC CCGGGGAGGA GGGGCCTCAC
CCGCTCTGA
 
Protein sequence
MIHLDRLVNV LGGYGARLVG PEPLRRRELR SVAMHDPTDD VPQLGDAFLA VGVDSAAEAV 
RLAEQARATV VIIRSATEPG DDVRARLGEG GPALLVVEPA VSWSQVSGVV YGLVLEGRET
EAGRGPSDLF TLADTVAAEV GGPVTIEDRA SRVVAYSASQ GGTDRVRRDT LVDRRVPERV
RERLDADGVL TRLAAATGPR FVPGIPELDM GGRTAAPIRV GRELLGTLWV ACDAPLDADR
SRALSEGAHT VGLHMLRARV SSDLERQVES ESVIDLLEGS ADPAQTAGRI GLLGAGLRVI
AFQARARTEP EAAILHLFEQ VTTGFGWSRP GRSTLLGTTV YTVLPCGDDP APAVEWVRST
LRGLPDRLGV VAGVGGTADA AGLPASRQEA DECLTLHGQA PEGADAVVYD DAWDSVLLRR
LRLVAEAGRM PVRNPVADLV RYDAEHGTDH ARTLRAWLYV HGDLGRAAEL LGLHPNTVRY
RLRRMGRVAD LPLHDPRARV AMTVALAALV ADPPGEEGPH PL