Gene Ndas_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2621 
Symbol 
ID9246472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3125385 
End bp3126788 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content70% 
IMG OID 
Productputative transcriptional regulator, Crp/Fnr family 
Protein accessionYP_003680544 
Protein GI297561570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.167094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0008696 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGATCG ACTCGGCTCC TGAGGTCCGG AGCAGCAAGC AGCAGAGCCT GGGTACCGCG 
GCCGCGCGGA ACCTGGCGAC CACCACCAAG TCCACGCCGC AGATGCAGGG CATCAGCTCC
CGCTGGCTGA CCCGGATGCT CCCGTGGGTG CACACCGACG GCGGCGCCTA CCGGGTCAAC
CGGCGGCTGA CCTACACCGT CGGCGACGGG CGGATCGAGT TCGAGCAGAC CGGCGCCCGG
GTCCGGGTCA TCCCCCACGA ACTCGGCGAG CTGGCCCTGC TGCGCGGCTT CGACGACGAG
GAGGTGCTGG CGGCCCTGGC CAACCGCTTC GTCCAGCGCG ACTTCGAGAA CGGCCAGGTC
CTGGTGGAGG AGGGCACGGC GGCCGACAGC CTGTTCCTGC TGGCGCACGG GCGCGTGCAC
AAGACCGGCA CCGGCCCCTA CGGCGAGGCC GTCCGGCTGG GCGTGCTGGC CGACGGCGAC
AGGTTCGGCG ACCAGCACCT GCTCGGCACC GAACCGGCGT GGGAGTACAC CGTCAAGGCC
GCGACGGCGG GCACGCTGCT GGAGCTGCCC CGCCGCGACT TCATCGCGAT CCTGGACGAC
TCGCCCGCCC TCCAGGCCCA CGTCCAGCAG TACCTGTCCC TGCCCGGCGA GCGGCAGAAC
AAGCACGGCG AGGCCGAGAT CGCCCTGTCC TCGGGCCACG TCGGCGAGGC CGAGCTGCCC
AGCACCTTCG TGGACTACGA GCTCAGGCCG CGCGAGTACG AGCTGAGCGT GGCCCAGACC
GTGCTGCGGG TGCACAGCCG GGTCGCCGAC CTCTACAACA AGCCGATGAA CCAGACCGAG
CAGCAGCTGC GGCTGACCAT CCAGGCCCTG CGCGAGCGCC AGGAGCACGA ACTGGTCAAC
AACCGCGAGT TCGGCCTGCT CCACAACGCC GAGTTCAAGC AGCGCATCCA GACCCACTCG
GGGCCGCCCA CCCCCGACGA CCTGGACGAC CTGCTGAGCA TGCGCCGCAA CACCCAGTAC
ATGTTCGCCC ACCCCCGCGC CATCGCCGCC TTCGGCAAGG AGTGCAACAG CCGGGGCCTG
AACATCGGCA CCGTCGAGGT CAACGGCCAC CACCTGCCCG CCTGGCGCGG GGTGCCCCTC
CTGCCCTGCG GCAAGATCCC GGTCACCGAG CACCAGACCT CCTCGATCAT CGCGGTCCGC
ACCGGCGAGG ACAACGAGGG CGTCATCGGC CTGTACCAGA CCGGCCTGCC CGACGAGGTC
GAGCCCGGCC TCAACGCGCG CTTCATGGGC ATCGACGACA AGGCCGTCAT CTCCTACCTC
GTCAGCACCT ACTACTCCGC CGCGGTGCTC GTCCCCGACG CCATCGGGAT CCTGGAGAAC
GCCGAGGTCC ACCCGCGTGG CTGA
 
Protein sequence
MSIDSAPEVR SSKQQSLGTA AARNLATTTK STPQMQGISS RWLTRMLPWV HTDGGAYRVN 
RRLTYTVGDG RIEFEQTGAR VRVIPHELGE LALLRGFDDE EVLAALANRF VQRDFENGQV
LVEEGTAADS LFLLAHGRVH KTGTGPYGEA VRLGVLADGD RFGDQHLLGT EPAWEYTVKA
ATAGTLLELP RRDFIAILDD SPALQAHVQQ YLSLPGERQN KHGEAEIALS SGHVGEAELP
STFVDYELRP REYELSVAQT VLRVHSRVAD LYNKPMNQTE QQLRLTIQAL RERQEHELVN
NREFGLLHNA EFKQRIQTHS GPPTPDDLDD LLSMRRNTQY MFAHPRAIAA FGKECNSRGL
NIGTVEVNGH HLPAWRGVPL LPCGKIPVTE HQTSSIIAVR TGEDNEGVIG LYQTGLPDEV
EPGLNARFMG IDDKAVISYL VSTYYSAAVL VPDAIGILEN AEVHPRG