Gene Ndas_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0666 
Symbol 
ID9244508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp817659 
End bp818795 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content70% 
IMG OID 
ProductrecA protein 
Protein accessionYP_003678617 
Protein GI297559643 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.386455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.202905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATCTG GAGACCGAGA CAAGGCTCTC GAAACAGCGC TCGCCCAGAT CGAGCGGCAG 
TTCGGCAAGG GCTCCATCAT GCGCCTGGGC GACGACGACC GGCCGCCGGT GGAGTCGATC
CCCACCGGGG CGATCGCGCT CGACGTGGCG CTCGGCATCG GAGGCCTGCC CCGGGGCCGC
GTCGTGGAGA TCTACGGCCC CGAGTCCAGC GGTAAGACCA CCGTCGCCCT GCACGCGGTG
GCCAGCGCCC AGCGGATGGG CGGCATCGCG GCCTTCGTCG ACGCCGAGCA CGCGCTCGAC
CCCGAGTACG CCAAGAAGAT CGGCGTCAAC ACCGACGACC TGCTGCTCTC GCAGCCGGAC
ACCGGTGAGC AGGCGCTGGA GATCGTCGAC ATGCTCATCC GCTCCGGAGC GGTCTCCATC
ATCGTCATCG ACTCCGTGGC GGCCCTGGTG CCCCGCGCCG AGATCGAGGG CGAGATGGGC
GACAGCCACG TCGGACTCCA GGCCCGCCTG ATGTCCCAGG CGCTGCGCAA GATCGCCGGT
GCGCTCCACC AGACCAACAC CACCGCGATC TTCATCAACC AGCTGCGCGA GAAGGTCGGC
GTCATGTTCG GCTCGCCCGA GACGACGACC GGCGGCAAGG CGCTCAAGTT CTACGCCTCG
GTGCGCCTGG ACGTGCGCCG CATCGAGACG CTCAAGGACG GCACCGACGC GGTCGGCAAC
CGCACCCGCG TCAAGGTCGT CAAGAACAAG GTCGCGCCGC CCTTCAAGCA GGCCGAGTTC
GACATCCTCT ACGGGGTGGG CGTCTCGCGC GAGGGCAGCC TCATCGACCT GGGCGTGGAG
CACGGCATCG TCCGCAAGTC GGGCGCCTGG TACACCTACG AGGGCACCCA GCTGGGCCAG
GGCAAGGAGA ACGCGCGCAA CTTCCTGCGC GAGAACGCCG ACATGGCCAA CGAGGTCGAG
AAGAAGATCA AGGAGAAGCT GGGCGTGCCC GTCAAGGGCG ACGACAGCGC CTCCGGCCCG
GCCGCCGAAC CGGCCAAGGC CGCCGCTGAG GCGGCTGCGG ACCCGGCCGC CGCGGCCAAG
GCACCCGCCA AGCGCGCCGC GGCCAGGACC CCCAAGGCGC CCGCGGCTGA TGCGTGA
 
Protein sequence
MASGDRDKAL ETALAQIERQ FGKGSIMRLG DDDRPPVESI PTGAIALDVA LGIGGLPRGR 
VVEIYGPESS GKTTVALHAV ASAQRMGGIA AFVDAEHALD PEYAKKIGVN TDDLLLSQPD
TGEQALEIVD MLIRSGAVSI IVIDSVAALV PRAEIEGEMG DSHVGLQARL MSQALRKIAG
ALHQTNTTAI FINQLREKVG VMFGSPETTT GGKALKFYAS VRLDVRRIET LKDGTDAVGN
RTRVKVVKNK VAPPFKQAEF DILYGVGVSR EGSLIDLGVE HGIVRKSGAW YTYEGTQLGQ
GKENARNFLR ENADMANEVE KKIKEKLGVP VKGDDSASGP AAEPAKAAAE AAADPAAAAK
APAKRAAART PKAPAADA