Gene Ndas_2412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2412 
Symbol 
ID9246262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2862859 
End bp2863971 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003680339 
Protein GI297561365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0744334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGTT CTGTCGAGGT TCGCTCCGAA GGGCGTCCGG TCAGCACGGG AGGGCCCAAA 
AGGCGTACGG TCCTGGCCGC ACTCCTCCTG CAACCCGGCA CCGTGGTCTC CGACAACCGG
CTCATCGACC TGGTCTGGGG CGAACACCCT CCGCGCAGCG CACGGTCCCA GCTCCAGGCG
CACGTGCACG AACTCCGCAA GGTCCTGGGC GCGGACACCA TCGTCCGGAG CACCTGCGGC
TACCGGATGA CCGTGGCGGC CGAGGCCACG GACAGCGGCG TGTTCGAGCG GCTGCTGGTG
CGCGCGCACT CCGAGCGCGC CGCCCGCCGC CCCTCCGATG CGGTCCGGAC CCTGCGCACC
GCGCTGTCGC TGTGGACGGG CCCGGCGCTG GGCGGCGTGA CACCGGCGCT CGCCGCCCAC
GCACGCCCGG CCCTGGAGGA GAGGCGGCTG CACGCCCTGG TGGAGCTGCA CGGCGCCGGG
ATCGACCTCG GCCGCGGAGC CGCCGCCGTC CCCGAACTGC TCTCGCTGTG CGCGGAGCAC
CCGACCCACG AACGCTTCGC GGGCCTGCTG ATGTCGGCCC TGCACGCCTG CGGACGCACC
GGGGAGGCGC TGGAGGTCTA CGCGAAGCTG CGCGAGAGGC TGGCCGACGA ACTGGGCACC
GGCCCCGGCG CCCGGCTCCG GCAGCTGCAC CTGGACCTGC TCACGGCCGG GCCGGAGGGC
GGGAGCACGC GCGGCGGCGC GCCCGTCGCC CACCGGGTCC GCCCCGCCGA ACTGCCCTAC
GGCGTCGGTG AGTTCGTCGG CCGCTCGGCG GAGCTGTCCG TGCTCGACCA GGCGCTGGAC
GACGACCGGG GGGCCGACCG CTGGCCCGCG GTGTTCCTCC TCACCGGGGT CTCCGGCGTC
GGCAAGACCG CGCTGGCCCT GCACTGGAGC CATGCCGTCC GGGAGCGCTT CCAGGATGGC
CAGCTCTACG TGGACCTGCG CGGTTCCTCC TCCGGGGGGG GGAGCCCGTC CGGGCCGAGG
ACGCGCTGCG CCAGCTGCTG CGCGGACTGG GCGCGGACCC CGGAGGCCTG CCGACCGGCG
CCGACGAGCT CGCCAAGCTC TTCCGGTCGG TGA
 
Protein sequence
MLGSVEVRSE GRPVSTGGPK RRTVLAALLL QPGTVVSDNR LIDLVWGEHP PRSARSQLQA 
HVHELRKVLG ADTIVRSTCG YRMTVAAEAT DSGVFERLLV RAHSERAARR PSDAVRTLRT
ALSLWTGPAL GGVTPALAAH ARPALEERRL HALVELHGAG IDLGRGAAAV PELLSLCAEH
PTHERFAGLL MSALHACGRT GEALEVYAKL RERLADELGT GPGARLRQLH LDLLTAGPEG
GSTRGGAPVA HRVRPAELPY GVGEFVGRSA ELSVLDQALD DDRGADRWPA VFLLTGVSGV
GKTALALHWS HAVRERFQDG QLYVDLRGSS SGGGSPSGPR TRCASCCADW ARTPEACRPA
PTSSPSSSGR