Gene Ndas_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1481 
Symbol 
ID9245331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1814121 
End bp1815404 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content76% 
IMG OID 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003679418 
Protein GI297560444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.438769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.39369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGTG CCCTGGCCAC CTGTGACATG GCGACGATCA TCGAGGGCGT GCGCGCGGCG 
CACGGCTGGT CCCAGGGCGA CCTCGCCCGC GCGATCGACT ACTCCCAGAG CTGGGTCTCG
CGCGTGGTCA ACGGCCAGCA GTCGCTGACC ATCAGCCAGG TACACGACCT CGCCCGACGC
CTGGGCATCC CGCTGCACAT GCTGCGTTTC GGCGGCACCG CGCCCCAGGC CCGTCCCGCC
GAGAAGGGGG TGGACACCAC GAGGCGCCGC GACTTCGGAC GCGCGGTGGC CGCGGGAGCC
CTGCTCACCG CCACCACCGT CACCACCGGT GCGGCTCCGG CCCCGGTGGC GTCGGACCCG
TACCCCGACA CCGTCAACGA GACGACCGCG CCCTCCCTGC GCGCCATCAC CGGGGGCCAG
CGCCGCATGG ACGCCACCTC CCCCTCCCGC CACCTGCTGC CCAGCGCTGT GGCGCACGTG
CATCTGGCCG AGCACATGCG TGGGCAGGCG CACGGGACGC CCTTCCACGG CGAGTTGAGC
GCCGCGGCCA GCGAGGCCTC CGGCTTCGCC GCCTGGCTGC ACGCGGACCG GGGGGACATG
GGGTCGGCCA GGGCGCACTA CCGCACCGCC GTGACGCGCG CGCGCCAGGC GGACATGCGC
CTGCTCGACG TGTACATGCT GGGCTCCCTG GCTGCCTTCG AGACCGACAC CGCCGAGGAC
CACCACCTGG GCCTGGGGCT GGTCCAGGAG GCCGAACACG TCCTGGGCCC CGCCGCCCAC
CCCACCGCGC GCGCCTGGCT GGCCTGCGTG GGGGCGCTCG CGCACGCCGG GAACGGGGAC
GGCGCTGCGG CCGCGCGCGC CCTGGGCCGG GCGGAGCGGG AGGTGGCCCG CTCCGCCAAC
ACCGACCCGC CCTGGCCGTG GGTGTTCGCC TTCGACGAGA CCAAGGTCGC CGGTTACCGC
GCCCGGGTGG GCGTGCGGCT GCGCCAGCCG CACGACGCCC GGCGGGCCTT CGCCGAGGCG
TTCGCGCCCA ACGGGGGCAA CCCCAAGCAG TCCGCGGTCC TCCAGGTGGA ACTGGCCTCG
GCGCACGCCG ACGCGGGGGA CGTGGACGAG GCCTTCCGCC TGGTCCACGA GGCGCTGAAC
ACGGGCGTGC GCTACGAGTC CGAGCGGATC ATCGGCCGGG TCCGGGCCTT TCGCCGCCGC
TGCTCGGGAA CGCGGGCGCG CTGCGTGGCC GACCTGGACG ACCGGCTCCT GTCCCTGGTG
AGGGGCGCCG CCGCCCAGGG GTAG
 
Protein sequence
MAGALATCDM ATIIEGVRAA HGWSQGDLAR AIDYSQSWVS RVVNGQQSLT ISQVHDLARR 
LGIPLHMLRF GGTAPQARPA EKGVDTTRRR DFGRAVAAGA LLTATTVTTG AAPAPVASDP
YPDTVNETTA PSLRAITGGQ RRMDATSPSR HLLPSAVAHV HLAEHMRGQA HGTPFHGELS
AAASEASGFA AWLHADRGDM GSARAHYRTA VTRARQADMR LLDVYMLGSL AAFETDTAED
HHLGLGLVQE AEHVLGPAAH PTARAWLACV GALAHAGNGD GAAAARALGR AEREVARSAN
TDPPWPWVFA FDETKVAGYR ARVGVRLRQP HDARRAFAEA FAPNGGNPKQ SAVLQVELAS
AHADAGDVDE AFRLVHEALN TGVRYESERI IGRVRAFRRR CSGTRARCVA DLDDRLLSLV
RGAAAQG