Gene Ndas_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3497 
Symbol 
ID9247366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4191569 
End bp4192987 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content75% 
IMG OID 
Productsigma54 specific transcriptional regulator, Fis family 
Protein accessionYP_003681404 
Protein GI297562430 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGCCC CGCGGTTGCG CGCCTCGTGG CGGCGCAGCG AGCACTACGG CGTCTCCGCC 
GAACGGGTGG AGCCCGCGTT CACCGGTTCC GTGGACACGG AGTCGCTGTT CTACGAGTGC
GGCAACGAGG TCCTGCGGGG TATCCAGCGG ACCATCGCCA ACGAACCGGT GAGCCTGATG
ATCACCGACA GCGAGGGGCT GGTGCTGAGC AGGCTCGCCA ACGACGCGGC GATCCTGCAC
TCGCTGGACC GGGTCCACCT CGCCCCCGGG TTCTCCTACG CCGAACGCGA CGCGGGCACC
AACGGCCTCG GGCTGGCCCT GGCCGACCGC GCGCCGTCGC TGGTCAGGGC GGAGGAGCAC
TACTGCACGG GGCTGCGCGG GTACACCTGC GCCGCCGCTC CGGTCCTGGA CCCGTCCGAC
GGCACGCTGG TCGGCAGCGT CAACCTGACC ACCTGGTCGG AGTCCTCGTC GGCCCTGCTG
CTGGCGCTGG CCCAGTCCGC CGCGCAGAGC ACGTCCGCGC TCATGCTCGC CCGGGGCACG
GGACGCCGGG TCCAGCCCGC GCCCAGGGGC GCGGTGTTCC GCTTCCGCGC CGCCGGCGGC
GGACAGGCCG ACGCCTGCGC GTCGCGGCTC TGGCGCGGCG CCGTCGCCGA GGCCCGCGAG
GCGGTGGGCG GGCGGACGCT GGCGGTCGTG GGCGAGCCGG GTTCGGGCCG GACCTCCCTG
GCCTCGCTCG CGCGGCGGCA GGTCAGCGCA CGGGAGAGGG TGCTCAACGC CCGTCCGCCC
GCCCCGGAGG ACGTGGACTC CTGGCTCACG CTCTGGACCC CCGAGCTGGC CAAGGACGAC
ACGTGCGTGA TCGTGTCCGG GGTGGAGGCG CTTCCGGCGT GGGGCGCGAG CGAGCTGGCC
CGGCTGCTGG CCGGGGCGCG GCGCGCGGGT GGGCGCCCCC AGCCCTTCGT GGTCACGGGA
CGGAGCTTCG ACGCTCTCCC GGAGGCGCTG CGGGAGCTGG TGGACACGGT GGTCGAGGCC
CCCGCCCTGC GCCGCCGCCC GGAGGACGTG CTGCCGCTCG CGCGGCACTT CGCGCAGGGG
GCCCGGGGGC GGGCGATCGG CCTCACCCCG GCGGCCTCCC GCGCGCTCAC CGACTACCAC
TGGCCGGGCA ACGCCACCGA GCTGAAGCGG GCGGTGTGCG ACGCGGCGCA GCGCGCGGAC
GTGGTCGACG TGCACCACCT GCCCGCCGAG GTCTTCCGCA GCAACGGCGG GCGCCGCCTC
AGCCGTATCC AGGCCGTGGA ACGCGACGAG ATCGTCCGCT GCCTGACCGC TCCGGGCGCG
ACCGTCGTCG GGGCCGCCGC CGAGCTGGGG ATGGGACGGG CGACCGTCTA CCGCAAGATG
CGCCAGTACA ACATCCGGAT GCCCCACCAG CAGGAGTAG
 
Protein sequence
MKAPRLRASW RRSEHYGVSA ERVEPAFTGS VDTESLFYEC GNEVLRGIQR TIANEPVSLM 
ITDSEGLVLS RLANDAAILH SLDRVHLAPG FSYAERDAGT NGLGLALADR APSLVRAEEH
YCTGLRGYTC AAAPVLDPSD GTLVGSVNLT TWSESSSALL LALAQSAAQS TSALMLARGT
GRRVQPAPRG AVFRFRAAGG GQADACASRL WRGAVAEARE AVGGRTLAVV GEPGSGRTSL
ASLARRQVSA RERVLNARPP APEDVDSWLT LWTPELAKDD TCVIVSGVEA LPAWGASELA
RLLAGARRAG GRPQPFVVTG RSFDALPEAL RELVDTVVEA PALRRRPEDV LPLARHFAQG
ARGRAIGLTP AASRALTDYH WPGNATELKR AVCDAAQRAD VVDVHHLPAE VFRSNGGRRL
SRIQAVERDE IVRCLTAPGA TVVGAAAELG MGRATVYRKM RQYNIRMPHQ QE