Gene Ndas_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1071 
Symbol 
ID9244917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1316982 
End bp1318823 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content72% 
IMG OID 
ProductIucA/IucC family protein 
Protein accessionYP_003679019 
Protein GI297560045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.549269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.885774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC CCACCACCGG ATTCGACGCC CGCCAGCCCC GCCTCCCCGA CCCCCGCGAC 
GCGGTCGCCC ACCTGGCCCC CGAGACGTGG GAGCGCGCCA ACCGGCTGCT CGTGCGCAAG
GCCCTGGCCG AGTTCGCCCA CGAACGGCTG ATCACCCCCG AACCCGGCCC CGACGGCCTC
TTCTCGGTCA CCAGCGACGA CGGCGGCGTC GAGTACCGGT TCGCGGCCCG CGTCATGGCC
CTGGAGCACT GGCGGATCGA GGCCGACAGC ATCGTCCGCC GCGACCTGCG GCGCGACGGC
GCGCACCTGC CCCTGGACGC CCTCGACCTG ATCCTGGACC TGCGCAAGAC GCTCACGCTC
GACGAGGACG TCCTGCCCGT CTACCTGGAG GAGATCACCA GCACCCTGGC CAGCAGCACC
TACCGCATGT CGGGCGCCCG ACCCGGCGCC GCCGAGCTGG CGCGCGCCGG GTTCCAGGAG
ATCGAGGCGG GGATGACCGA GGGCCACCCG TGCTTCGTCG CCAACAACGG GCGCCTGGGC
TTCGACGCCG CGGAGTACCG CGCCTACGCC CCCGAGGCCG CCGCCCCGGT GCGGCTGGTG
TGGGTCGCCG CGCGGCGCGA GCGCACGGTC TTCAGCTGCT CCGCCGACCT GGACCGCGAG
GAACTGCTGC GCGGTGAACT CGGCGCCCAG ACCCTGGCCG TCTTCGACGC GCGGCTGACC
GACATGGGCC TGGACCCCGC CGACTACCAC CTGATCCCGG TCCACCCCTG GCAGTGGTGG
AACCGGCTGG CGGTCACCTT CGCCGCCGAC GTCGCCCGCC GCGACCTGGT GTGCCTGGGC
CACGGCCCCG ACGAGTACCG CGCCCAGCAG TCCATCCGGA CCTTCTTCAA CACCTCGGCC
CCCGAACGCC ACTACGTCAA GACCGCCCTG TCGGTGCTGA ACATGGGCTT CCTGCGCGGG
CTGTCCGCCA AGTACATGGA GGCCACCCCG GCCATCAACG ACTGGGTCGC CTCGGTCGTG
GCCGACGACC CCGTCCTGCG CGCCACGCGC GTGGAGATCC TGCGCGAGCT GGCCGCGGTG
GGCTACCGCA CCACGCACTA CGCGCAGGCC TCGACGGAGA AGTCGCCCTA CCTGAAGATG
ACCGCCGCGC TGTGGCGCGA GAGCCCGGTG ACACGGCTGC GGTCCGGCGA GCGCCTGGCC
ACGATGGCCT CGCTGCTGCA CACCGACGCC CGGGGGAGGT CCCTCGCCGC GGAGCTCATC
GCCGAGTCGG GCCTGGAGCC CGCGGAGTGG CTGCGCCGCT ACCTCGACGC CTACCTCGTC
CCCCTGCTGC ACTGCTTCTA CGCCCACGAC CTGGTGTTCA TGCCGCACGG GGAGAACGTC
ATCCTCGTCC TGCGCGACGG GGTGCCCCAG CGGGTCCTGC TCAAGGACAT CGCCGAGGAG
ATCGCGGTGA TGAACGACGA CGCCGAACTG CCCCCCGGGG TCGAGCGGGT CCAGGGCGCG
GTGCCCGACG ACATGCGGGT GCTGTCGCTG TTCACCGACG TCTTCGACTG CTTCCTGCGC
TTCCTCAACG GCATCCTCGC CGGTGAGGGC GTCCTGTCCG AGACCGACTT CTGGCGCACC
GTGGCCGCCT GCGTCGCCGA CTACCAGGCG TCGGCGCCCC ACCTGGCCGA CCGTTTCGCG
CGCGACGACC TGTTCGCGGA ACGCTTCGCG CTCTCGTGCC TGAACCGGCT GCAACTGCGC
AACAACCGCC AGATGGTCGA TCTGGAGGAC CCGTCCGCGG GCCTGCAACT GGTCGGCACG
CTGGAGAACC CCATCGCCGG GTTCCGTCCC GCGCCCCGGT AG
 
Protein sequence
MTTPTTGFDA RQPRLPDPRD AVAHLAPETW ERANRLLVRK ALAEFAHERL ITPEPGPDGL 
FSVTSDDGGV EYRFAARVMA LEHWRIEADS IVRRDLRRDG AHLPLDALDL ILDLRKTLTL
DEDVLPVYLE EITSTLASST YRMSGARPGA AELARAGFQE IEAGMTEGHP CFVANNGRLG
FDAAEYRAYA PEAAAPVRLV WVAARRERTV FSCSADLDRE ELLRGELGAQ TLAVFDARLT
DMGLDPADYH LIPVHPWQWW NRLAVTFAAD VARRDLVCLG HGPDEYRAQQ SIRTFFNTSA
PERHYVKTAL SVLNMGFLRG LSAKYMEATP AINDWVASVV ADDPVLRATR VEILRELAAV
GYRTTHYAQA STEKSPYLKM TAALWRESPV TRLRSGERLA TMASLLHTDA RGRSLAAELI
AESGLEPAEW LRRYLDAYLV PLLHCFYAHD LVFMPHGENV ILVLRDGVPQ RVLLKDIAEE
IAVMNDDAEL PPGVERVQGA VPDDMRVLSL FTDVFDCFLR FLNGILAGEG VLSETDFWRT
VAACVADYQA SAPHLADRFA RDDLFAERFA LSCLNRLQLR NNRQMVDLED PSAGLQLVGT
LENPIAGFRP APR