Gene Ndas_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1646 
Symbol 
ID9245496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2018817 
End bp2020028 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003679581 
Protein GI297560607 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.24387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCAG CCACCAGAGG CCAGATCATC CGCTCGCACC GCCGACGACG CGGCTACAGC 
CAGACCGTAC TCGCTGGTCT CGTGGGCCGC TCCGAGTCCT GGCTGTCCCA GGTCGAACGC
GGCAAGCTCC CGGTCGACAG CCACGAGGTG CTGTCGAGGC TCGCGGACGT GCTCCGCCTC
CCACTCGACG AGCTCACCGG AACCACCGAA GAGACCATCC CTGTGCGCTA TGCCCCTGCC
GACGCCATCG AACGCGCCAT GATGCGGTAC ACCTCCCTGG AGGTGATCGT CGCTGAGACC
GGCGGCACCC CTCAGGCTGT TGATGTGGAA CGCCTGCGCG CCGAGGCCCA CCACACCTAC
GCCGCCTACC AGGCCACCCG GTACGCCGAA GTAGGCCGCC GCCTCCCCCG CCTCATCCGC
GATGTCGAGG CAGCCGCGCG CTCACGCGGC GCAGACCGCC CAGCCGTGTG CTCGGCCAGG
GCGATGGTCT ACAACACGGC CGCCGCCGTC CTGCGGCGTG TCGGCGCGAA GGACCTGGCA
TGGCAGGCCG CCGACCGGGC CATGTCCGCG TCCGAGTGGG CCGACGAGAC CCTGCTGGCC
GCCGTCGGCG CCTACCGGCT GTCCTACGTT TTCATCAGCC GTGGCAACCC CGACGTGGCT
GCGGAGCTCG CGATGGGAGC GGCGCACGCC CTGGAACGGC GGATGCGCCC CGGCACCCCG
GAAGAGCTGT CGGTGTACGG GGGGCTGCAC CTGGCGGCTG CGACGGCCGC CGCGGCCGAG
TACGACCGGG CTGCGGTCCC CCGGTTCCTG GCCCAGGCCC AGCGGGTCGC CGACCGGCTG
GGCCAGGACC TGAACTTGCA CGGAACGGCG TTCGGGCCAA CGAACGTCGC CATCCACACC
ATCAGCACCA GCGTCAAGAC CGGAGACGCG AAGACCGCGG TCGCCGCAGG AGAGACCCTC
GCCGTCGAGC ACCTGCCCGC CGGGCTCGTC GGCCGCCGTG CCCAGGTGCA CCTGGACGTG
GCGTGCGCCT ACGCCCAGAC CCGTCAGGAC GCCGCTGCCG TCAACACGTT GTTGGAGGCG
GAGCGGATCG CCCCGGAGCT GGTGCGGCAC GACCCGGCGA CAGGGAGGGT GCTGACAGAG
CTGCTGCGCC GAGAACACCG CCGATCCACC CCTGAGCTGC GGCCGTTGGC CCAGCGCGCC
GGGGTCAGCT GA
 
Protein sequence
MDAATRGQII RSHRRRRGYS QTVLAGLVGR SESWLSQVER GKLPVDSHEV LSRLADVLRL 
PLDELTGTTE ETIPVRYAPA DAIERAMMRY TSLEVIVAET GGTPQAVDVE RLRAEAHHTY
AAYQATRYAE VGRRLPRLIR DVEAAARSRG ADRPAVCSAR AMVYNTAAAV LRRVGAKDLA
WQAADRAMSA SEWADETLLA AVGAYRLSYV FISRGNPDVA AELAMGAAHA LERRMRPGTP
EELSVYGGLH LAAATAAAAE YDRAAVPRFL AQAQRVADRL GQDLNLHGTA FGPTNVAIHT
ISTSVKTGDA KTAVAAGETL AVEHLPAGLV GRRAQVHLDV ACAYAQTRQD AAAVNTLLEA
ERIAPELVRH DPATGRVLTE LLRREHRRST PELRPLAQRA GVS