Gene Ndas_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2237 
Symbol 
ID9246087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2677018 
End bp2678340 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID 
Productputative transcriptional regulator, GntR family 
Protein accessionYP_003680165 
Protein GI297561191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0384062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.194146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCTG ACCACCGAGA GCAGCAGCCC AGCCAACAGG GTTCGCGTAT CGACCCGCAC 
GTCGGCCGCT ACGCACGCCG AGCACAGGGC ATGGTCGCAT CGGAGGTCCG CGCACTCTTC
GCCGTCGCCT CGCGCCCCGA GGTGGTCTCC CTCGCCGGGG GCATGCCCAA CGTCGCGGCT
CTGCCCCTGG ACCAGATCGG CGAACTGGTC AAGGACGTCG TCTCCGAGGA GGGCGCGGCG
GCCCTCCAGT ACGGTTCCGC CCAGGGCGAC CCCGTGCTGC GCGAGCAGCT CTGCGACTAC
ATGACGCTGG AGGGCATCAC CGCCAGCCCC GACGACGTCA TCGTCACCGT CGGCTCCCAG
CAGGCCCTGG ACCTGATCAC CCGCGTCTTC GTCGACCCGG GCGACATCGT GCTGACCGAG
GCCCCCACCT ACGTCACCGC GATCAACACC TTCGCCGCCT TCCAGGCCGA CATCCGCCAC
GTCGGCATGG ACGAGCAGGG CGTGATCCCC GAGGAGCTGG AGGAGGCGCT CGTCCGCGCC
GAGCTGGACG GGCGGCCGGT GAAGTTCTTC TACACCGTCC CCAACTTCCA GAACCCCGCC
GGGATCACCA TGACGGCCGA GCGGCGCGCG CGGGTGACCG AGGCCTGCGA ACGCCACGAC
GTGCTCATCG TCGAGGACAA CCCCTACGGC CTGCTGCGCT ACGACGGCGA CCCCGAGCCC
ACCCTCTACT CCCAGAGCGA GGGCAACGTC ATCTACCTGG GATCGCTGTC CAAGACCCTC
TCCCCGGGCC TGCGCATCGG CTGGGCGCTC GCCCCCGCCG CGGTGCGCGC CAAGCTCGTC
CTGGCCGCCG AGTCGGCGAT GCTCAGCCAC TCCACCTTCA ACCAGCTGGT GGTGCGCCGC
TACCTCAACA CCTTCCCCTG GCGCGAGCAG ATCAAGTCGT TCAACACCAT GTACGGCGAA
CGCCGCGACG CGATGCTCAA CGCGCTGACC GCGATGATGC CCGCGGGCTG CACCTGGACC
CGCCCCCAGG GCGGATTCTT CGTCTGGGCC ACCCTGCCCG AGGGCATCGA CTCCAAGGCC
ATGCTGCCCC GGGCCGTCAC CGAACGGGTC GCCTACGTGC CCGGCACCGG CTTCTACGCC
GACGGACGCG GCCGGGCCAA CATGCGCCTG AGCTTCTGCT ACCCCACACC CGAGCAGATC
CGCGAGGGAG TGCGCCGCCT CGTCGGAGCC ATCGAGGGTG AGATCGACCT GCGTGATACG
TTCGGCACCA CCCTGGCCCC GAGCGCAGAC GGCCCACAGG CCCCCGCTCC GGACCTGCCC
TGA
 
Protein sequence
MSADHREQQP SQQGSRIDPH VGRYARRAQG MVASEVRALF AVASRPEVVS LAGGMPNVAA 
LPLDQIGELV KDVVSEEGAA ALQYGSAQGD PVLREQLCDY MTLEGITASP DDVIVTVGSQ
QALDLITRVF VDPGDIVLTE APTYVTAINT FAAFQADIRH VGMDEQGVIP EELEEALVRA
ELDGRPVKFF YTVPNFQNPA GITMTAERRA RVTEACERHD VLIVEDNPYG LLRYDGDPEP
TLYSQSEGNV IYLGSLSKTL SPGLRIGWAL APAAVRAKLV LAAESAMLSH STFNQLVVRR
YLNTFPWREQ IKSFNTMYGE RRDAMLNALT AMMPAGCTWT RPQGGFFVWA TLPEGIDSKA
MLPRAVTERV AYVPGTGFYA DGRGRANMRL SFCYPTPEQI REGVRRLVGA IEGEIDLRDT
FGTTLAPSAD GPQAPAPDLP