Gene Ndas_0057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0057 
Symbol 
ID9243887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp74282 
End bp76021 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003678015 
Protein GI297559041 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000194766 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGAAA ACCGCCAGCT TCGGTGCGCC GCGTCGGCGA ACATGGATGG CGTGATGGAC 
GACGACCAGC GCTACCGCGC CGTACACAGC AGGGACGCCC GCTTCGACGG CGTCTTCTAC
ACAGCCGTGC GCACCACGGG TATCTACTGC CGTCCGAGCT GCCCGGCCGT CACCCCGAAA
CGGGCCAACG TCCGCTTCTA CCCGACCGCC GCGGCCGCCC AGGAGTCGGG GTTCCGCGCC
TGCAAGCGGT GCCGTCCCGA CCTCACCCCC GGTTCGCCGG AGTGGAACCT GCGGGCCGAC
GTGGTGGGCC GCGCGATGCG CCTCATCCAG GACGGCGCCG TGGACCGGGG CGGCGTCAGC
GCCCTGGCCT CGGCGGTCGG CTACAGCGAA CGCCAGCTCA ACCGGCTGCT GTCGGCCGAG
GTGGGCGCGG GGCCGCTGGC CCTGGCCCGG ACCGAGCGCG CCCAGACGGC ACGGGTGCTG
GTGGAGACCA CCGACATGCC GATGGCGGAC GTCGCCTTCG CCGCGGGGTT CGCGAGCGTA
CGCCAGTTCA ACGAGACGAT GCGCGCGGTG TTCGACCGCT CCCCCACCGA GATGCGGACC
ATGGGCGGTC GGCGTGCGTC GGCTTCCGGG GGGTCGCGTC CGGGGGACGC CCGCCCTCCG
GCCACCGAGC CGGGGACGGT CACGCTGCGG CTGCCCTACC GGGAGCCCAT CGACCTGGCC
CGGATGCTGA GGTTCCTCGG AGACCGTGCG GTTCCGGGCG TGGAGGAGTA CCGGGACGGG
GTCTACCGCA GGACGCTGAT GCTGGCGCAC GGTCCCGCCG TGGTGGAGTT GTCCGAGGGG
TCCGGGACCG GCAGGGCGGG CAGGACCGGC CGCGCTGGTG CCACGGGCGG CGTTCGTCCG
GCGGACGCTG TGGACGGCGG GGTGTCCGTG AGCGGTGGGG GGCACGTGCT GTGCCGCCTG
CGGTTGTCGG AGGCGCGCGA CCTGACCAGC GCGGTGCGCA GGTGCCGCAG GCTGCTCGAC
CTGGACGCCG ACCCGGGTGC GGTGGCCGAG GCCCTGGGCG GGGACCCCCT CCTGGGACCG
ATCGTGGCCG CCCACCCGGG ACTGCGATCG CCGGGGCACG TGGACCCGGC CGAACTGGCG
GTCCGGGCGG TCCTCGGCCA GCAGGTGTCG GTGCGTGCGG CCCGCACGCT GGCGGGGCGG
CTGGTCGAGC GGTTCGGCGA ACCGCTCGCT CCGGGCCTGG AAGCGCCGGG CGGAGGACTC
ACCCACGTGT TCCCCTCCCC TGACGCGCTC GCCGCGGCCG ACCCGGCCGG TTTCTCCGTC
CCGGTCGCGC GGGGACGCGC CCTGGCGGGG CTGTGCGAGG CGATCGCCTC GGGGTGGATC
GACCTGGGGC CGGGATGCGA CCGGGACGAG GCCGAACGGC GTCTGGTGGA GCTGCGCGGC
ATCGGTCCGT GGACCGCCGG TTACGTGCGC ATGCGGGGTC TGGGCGACCC GGACGTGTTC
CTGCACGGCG ACCTGGGCGT CCGGATGGCG CTGGAGGCGG GGGGCAGACG GGCGACCCCC
GCGGCGGCCG CGCGCGAGGC ACGGGAGTGG AGCCCGTGGC GGTCCTACGC CAACCACGCG
CTGTGGGCGT CGTTGGCCGA CCGTGAGCGG GAGAGCACGG CCGTTCGGGC GGACGTGGTC
GTGCGGGACG GCGTGCGGGA TGCTTCGAAG GAACGTCAGG AACGGAAGGA ATCGGCATGA
 
Protein sequence
MSENRQLRCA ASANMDGVMD DDQRYRAVHS RDARFDGVFY TAVRTTGIYC RPSCPAVTPK 
RANVRFYPTA AAAQESGFRA CKRCRPDLTP GSPEWNLRAD VVGRAMRLIQ DGAVDRGGVS
ALASAVGYSE RQLNRLLSAE VGAGPLALAR TERAQTARVL VETTDMPMAD VAFAAGFASV
RQFNETMRAV FDRSPTEMRT MGGRRASASG GSRPGDARPP ATEPGTVTLR LPYREPIDLA
RMLRFLGDRA VPGVEEYRDG VYRRTLMLAH GPAVVELSEG SGTGRAGRTG RAGATGGVRP
ADAVDGGVSV SGGGHVLCRL RLSEARDLTS AVRRCRRLLD LDADPGAVAE ALGGDPLLGP
IVAAHPGLRS PGHVDPAELA VRAVLGQQVS VRAARTLAGR LVERFGEPLA PGLEAPGGGL
THVFPSPDAL AAADPAGFSV PVARGRALAG LCEAIASGWI DLGPGCDRDE AERRLVELRG
IGPWTAGYVR MRGLGDPDVF LHGDLGVRMA LEAGGRRATP AAAAREAREW SPWRSYANHA
LWASLADRER ESTAVRADVV VRDGVRDASK ERQERKESA